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PREFACE 


This book attempts to communicate the central ideas and results of 
game theory and related decision-making models unencumbered by their 
technical mathematical details: thus, for example, almost no proofs are 
included. It is a book about game theory, not a presentation of the 
theory itself. By laying bare the main structure of the theory—its assump- 
tions and conclusions, its deficiencies and aspirations—we hope that the 
book will serve as a useful critical introduction to the theory and a guide 
to the literature. We have tried, on the one hand, to make sufficiently 
precise statements so that misunderstandings and misstatements will not 
result from a reading of the book, but, on the other hand, we have striven 
to keep the language and notation sufficiently familiar and simple that 
there will be scientists who will benefit from it who would have found a 
treatise on game theory unintelligible. There are many mathematicians, 
even among those sympathetic to social science applications, who feel that 
these goals are incompatible, and we cannot deny having reached times 
of despondency when we were ready to agree. 

In many ways the overall outline of this book parallels the original 


structuring given to the theory by von Neumann and Morgenstern in 
Theory of Games and Economic Behavior [1944, 1947], 


: cs bh but in detail it is 
different: First, in the decade since the second edition 


of their book there 
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If we have not failed completely, then there 


made upon the reader. ae 
should be something of interest here for a wide group of scholars: econo- 


mists concerned with economic theory, political scientists and sociologists 
having a methodological bent or a theoretical concern with conflict of 
interest, experimental psychologists studying decision making, manage- 
ment scientists interested in theories of “rational” choice and organization, 
philosophers intrigued with the axiomatization of portions of human 
ON  eratiane th professionally practicing decision makers, 
porting. s—those whose work, for the most part, we are 
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ground to the theory of games itself, an si | . 
cuss, we must examine the modern seg y oll 

in risky situations- -utility theory. | a a 
Chapter 3 through 12 the theory of ees ” Z ‘ 
the general model; Chapters 4, 5, anc ; m i. . 
games, and Chapters 7 through 12 the the a : : . 
two players. Chapter 13 turns to the prope 
making when the outcomes are not simply nee ee 
This material, like that in Chapter 14, is included partly Deca aoe 
inherent interest as part of the problem of decision making, but also pay 
because these models are related in various ways to game theory. The 
final chapter, 14, may be described as dealing with problems in group 
decision making, in contrast to all the preceding work, which is devoted 
to the individual in different “environmental” contexts. The eight 
appendices are concerned with more technical topics which arise naturally 
in various parts of the book, but which we chose not to present in the body 
of the book. 

Depending upon his interests and background, the reader may elect 
not to read the chapters in order or not to read all of them. Certain 
plausible groupings come to mind, and these may be worth mentioning. 

i. Chapters 1, 3, 4, 7, and 8 give the general coverage of game theory 
without going into some of the more special and controversial topics. 
Although this does not include utility theory (Chapter 2), it is probably 
an adequate program of reading for a novice who wants some background 


in the subject, but who does not care to go into it deeply or to explore the 
various related topics. 
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ii. Chapters 5 through 12 delve into the conceptually difficult and not 
fully satisfactory theory of general games—those which either have more 
than two players or are not zero-sum or both. The reader already quite 
familiar with two-person zero-sum theory may want to begin with Chapter 
5, although we would also recommend that he read Chapter 3 where the 
basic postulates about the players are introduced and criticized. 

ili. The bulk of the research activity has been on games with only two 
players, provided we include non-zero-sum, infinite, and recursive gam 
as well as the more familiar zero-sum two-person games Saee 
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chapter 1 


(GENERAL INTRODUCTION 
TO THE THEORY OF GAMES 


1.1 CONFLICT OF INTEREST 


In all of man’s written record there has been a preoccupation with con- 
flict of interest; possibly only the topics of God, love, and inner struggle 
have received comparable attention. The scientific study of interest 
conflict, in contrast to its description or its use as a dramatic vehicle, 
comprises a small, but growing, portion of this literature. As a reflection 
of this trend we find today that conflict of interest, both among individuals 
and among institutions, is one of the more dominant concerns of at least 
several of our academic departments: economics, sociology, political 
science, and other areas to a lesser degree. 

It is not difficult to characterize imprecisely the major aspects of the 
problem of interest conflict: An individual is in a situation from which one 
of several possible outcomes will result and with respect to which he has 
certain personal preferences. However, though he may have some con- 
trol over the variables which determine the outcome, he does not have 
full control. Sometimes this is in the hands of several individuals who, 
like him, have preferences among the possible outcomes, but who in gen- 
| eral do not agree in their preferences. In other cases, chance events 

(which are sometimes known in ee as “‘acts of God’’) as well as other 
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entitled their now classical book Theory of Games and Economic Behavio 
presumably to forestall that interpretation, although this does not ie 
size the even wider applicability of the theory. 
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1.2 HISTORICAL BACKGROUNDS 


: The ele mathematical approach to interest conflict—game theory— 
eae to von Neumann in his papers of 1928 and 1931 
” es eel a has raised a question of priority 
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judging by the attention given it in 

in the mathematical ones, thev were not without success his all Only 
a very few scientific volumes as mathemati il as this one have ar yused as 
much interest and general admiration Yet we know that much of the 


material had lain dormant in the literature for two decades. Presumably 
the recent war was an important contributing factor to the later rapid 
development of the theory. During that period considerable activity 
developed in scientific, or at least systematic, approaches to problems 
that had been previously considered the exclusive province of men of 
“experience.” ‘These include such topics as logistics, submarine search, 
air defense, etc. Game theory certainly fits into this trend, and it is one 
of the more sophisticated theoretical structures so far resulting from it. 

Though it is not directly relevant to the theory itself, it is worth empha- 
sizing again that game theory is primarily a product of mathematicians 
and not of scientists from the empirical fields. In large part this results 
from the fact that the theory was originated by a mathematician and was, 
to all intents and purposes, first presented in book form as a highly formal 
(though, for the most part, elementary) structure, thus tending to make it 
accessible as a research vehicle only to mathematicians. Indeed, we 
believe that so far the impact of game theory has been greater in applied 
mathematics, especially in mathematical statistics, than in the empirical 
sciences. 


1.3 AN INFORMAL CHARACTERIZATION OF A GAME 


Game theory does not, and probably no mathematical theory could, 
encompass all the diverse problems which are included in our brief charac- 
terization of conflict of interest. In this introduction we shall try to cite 
the main features of the theory and to present some substantive problems 
included in its framework. The reader will easily fill in examples not now 
in the domain of the theory, and as we discuss our examples we shall 
point out some other important cases which are not covered. 

1 The original edition of Theory of Games and Economic Behavior appeared in 1944, but 
the revised edition of 1947 is the more standard reference and it includes the first state- 


ment of the theory of utility, which we shall discuss in Chapter 2. All of our references 
will be to the 1947 edition. 


the ThEOFY om 


General atroducl! ste outcomes of a ‘ 
lle nd that €a* h ind 
oO them. Thus, i 

: ‘th others and 
z., way or another 
igpe offered any pt 
This problem of it 
ure We shall 
about the assumpti 
hole of Chapter 


hose familiar wi! 


concent 
it ¢ 


of alternatives ‘3 EP eswhto 
no confusion 
devoted the wW 
In order that t 
es of the word “utility” not bested 
tablishing how the modern 


have 


modern utility 


and somewhat discredited, us 


effort in es 


shall expend some . 

i i rent the Tr’ | e " 
differs from the earlier ideas. In brief, the cur! 0 wee 
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to be prescribed, then his preferences can be represented numerically by 
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A theory such as we art discussing 

assumptions about the individual 

We have already stated one: each | 
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inthe game. For example, poker, w hen rn 

with numerical payoffs assigned to each ol the outcomes, and 0 3 O 
play the game is to maximize one’s expected money outcome, But thes ¢ 
are players who enjoy the thrill of bluffing for its own sake, and they bluf 
with little or no regard to the expected payoff. Their utility functions 


cannot be identified with the game money payments. Indeed, there are 
many who feel that the maximization assumption itself is tautological, and 
that the empirical question is simply whether or not a numerical utility 
exists in a given case. Assuming that behavior is correctly described as 
the maximization of utility, it is quite another question how well a person 
knows the functions, i.e., the numerical utilities, the others are trying to 
maximize. Game theory assumes he knows them in full. Put another 
way, each player is assumed to know the preference patterns of the other 
players. 

This, and the kindred assumptions about his ability to perceive the game 
situation, are often subsumed under the phrase “the theory assumes 
rational players.” Though it is not apparent from some writings, the 
term “rational” is far from precise, and it certainly mears different things 
in the different theories that have been developed. Loosely, it seems to 

include any assumption one makes about the players maximizing some- 
thing, and any about complete knowledge on the part of the player in a 
very complex situation, where experience indicates that a human being 
would be far more restricted in his perceptions. ‘The immediate reaction 
of the empiricist tends to be that, since such assumptions are so at variance 
with known fact, there is little point to the theory, except possibly as a 
mathematical exercise. We shall not attempt a refutation so early, 
though we feel we have given some defense in later chapters. Usually 
added to this criticism is the patient query: why does the mathematician 
not use the culled knowledge of human behavior found in psychology and 
sociology when formulating his assumptions? The answer is simply that, 
for the most part, this knowledge is not in a sufficiently precise form to be 
incorporated as assumptions in a mathematical model. Indeed, one 
hopes that the unrealistic assumptions and the resulting theory will lead 


- experiments designed in part to improve the descriptive character of the 
theory. 
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7 of strategy is an abstraction of this ordinary concept 
» et] 


upposed that no ambiguity remains with respect to either 


iia 


the conditions or the actions. With this concept one apparent difficulty 
in applying the game theoretic model to economic problems evaporates. 
The notion of a pure strategy, and some related concepts, will receive 
considerably more discussion in Chapters 3 and 4. 

At least two $c render it difficult in practice to put many eco- 
nomic problems in game form. In general, it is hard to specify precisely 
the strategy sets available to the players. This may stem from a variety 
of causes, but one of the most striking is the possible modification of the 
strategy sets during the execution of the game. For example, this may 
result from a new invention or scientific discovery which opens a whole 
new range of activities to a producer. It is true that such complications 
can be encompassed formally by using the theories of decision making 
under uncertainty, which are discussed in Chapter 13, but this takes one 
outside the realm of games as we have been discussing them. Moreover, 
whether the current resolutions of, say, the invention problem are really 
useful is, at this time, debatable. If we restrict ourselves to just the 
formalism of risk, omitting uncertainty, then such a possibility causes 
trouble and so we may only hope at best to obtain limited predictions. 
This type of limitation seems to be regarded by many social scientists as a 
terrible inadequacy, and yet it is a common difficulty in all of physical 
science. It is analogous to a physical prediction based on boundary 
conditions which may be subject to change during the process, either by 
external causes or through the very process itself. The prediction will 
only be valid to the extent that the conditions are not changed, yet such 
predictions are useful and are used even when it is uncertain whether the 
assumed invariance actually holds. In many ways, social scientists seem 
to want from a mathematical model more comprehensive predictions of 
complex social Phenomena than have ever been possible in applied 
physics and engineering; it is almost certain that their desires will never 
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losing, to which one can assign a more or less arbitrary numerical scale, 
such asiand0. Very often a player aims to maximize his expected gain 
as described by the numerical score of the game; but, as we pointed out 
earlier, there are cases when this score function cannot be identified with 
the person’s utility, such as when an adult purposely loses to a child. 
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cept of a military strategy is common, even if it is not always clearly 
formulated. ‘The second problem is again more profound, and it appears 
to render difficult a game theoretic analysis of many important military 
situations; but certainly some significant ones are subject to the theory. 
One of the simplest is the “duel,” which in an elementary form consists 
of two players 1 and 2 having p and q “shots” respectively. For each 
player i there is a function which gives the probability that a shot fired by 
i at any time ¢ will result in a “hit,” let us suppose a fatal hit. We may 
suppose that the domain of ¢ is limited, as it would be by the fuel supply in 
an air engagement. The problem is then to determine when each player 
best take each of his shots, assuming that he knows how many shots his 
opponent has already taken, so as to maximize the probability that he 
will hit his opponent before being hit. For most duel situations of inter- 
est, the probability of a hit increases with time, as, for example, in the 
classical duel of two men walking towards each other with guns leveled. 

Political controversies are still another fertile source of situations involv- 
ing conflicts of interest. In addition to the difficulties of the economic 
and military problems with respect to ill-defined domains of action, we 
know that here there is considerable ambiguity as to the outcome, or 
payoff, function even over a known domain of possible actions. This is to 
some extent true in the other situations we have described, but it is over- 
whelmingly obvious in the political realm, where, for example, the defeat 
of a candidate has sometimes been attributed (after the fact) to a single 
sentence out of the hundreds he spoke in a campaign. 

A feature suggested by political and economic conflicts is the “social 
arbiter.” Often it is felt that conflicts of interest should not be allowed 
to resolve themselves in, shall we say, the open market of threats and 
counter threats, but that there should exist social devices to take into 
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than the above discussion suggests. This is more appropriately discussed 
in Chapter 6 than here, and we shall confine ourselves to remarking that 
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bodies, etc.; furthermore, when known tools were not applicable n 
mathematics was created. Most other attempts at mathematization 
(with the exception of statistics which plays a special role) have tended to 
take over small fragments of the mathematics created to deal with physical 
problems. If we can judge from physics, the main developments in the 
mathematization of the social sciences will come—as in game theory— 
with the development of new mathematics, or significantly new uses of 
old mathematics, suited to the problem. No one of these theories should 
be expected to be a panacea, but their cumulative effect promises to be 
significant. 

The achievement of von Neumann and Morgenstern is remarkable: In 
the first major publication on the subject, they formulated a clear abstrac- 
tion, drawn from the relatively vague social sciences, having both con- 
siderable breadth and mathematical depth, and they developed an 
elaborate and subtle superstructure with masterful scope. The depth of 
their contribution can be partially appreciated from the fact that today 
the material still must be presented according to their outline; there have 
been additions, true, but the main concepts are unchanged. 
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whether a il or (ii) a group, and accord- 
ing to whether it is effected under conditions of (a) certainty, () risk, or 


uncertainty. To this last classification we really must add (d) a com- 


a 
bination of uncertainty and risk in the light of experimental evidence. 
This is the Ls ovince of statistical inference. 

The di stinction between an individual and a group is not a biological- 
social one but simply a functional one. Any decision maker—a single 
human being or an organization—which can be thought of as having a 
unitary interest motivating its decisions can be treated as an individual 
in the theory. Any collection of such individuals having conflicting 
interests which must be resolved, either in open conflict or by compromise, 
will be considered to be a group. These are not clearly defined formal 
words in the theory; rather they are vague classificatory concepts suggest- 
ing the identifications one might make in applications. Depending upon 
one’s viewpoint, an industrial organization may be considered as an 
individual in conflict with other similar organizations or as a group com- 
posed of competing departments. 

As to the certainty-risk-uncertainty classification, let us suppose that a 
choice must be made between two actions. We shall say that we are in 
the realm of decision making under: 

(a) Certainty if each action is known to lead invariably to a specific out- 
come (the words prospect, stimulus, alternative, etc., are also used). 

(6) Risk if each action leads to one of a set of possible specific outcomes, 
each outcome occurring with a known probability. The probabilities 
are assumed to be known to the decision maker. For example, an action 
might lead to this risky outcome: a reward of $10 if a “fair”? coin comes 
up heads, and a loss of $5 if it comes up tails. Of course, certainty is a 
degenerate case of risk where the probabilities are 0 and 1. 

(9) Uncertainty if either action or both has as its consequence a set of 
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Decision making under certainty is a vast area! ine 
theory in economics, psychology, and management sciences cal are a 
andes this heading. Until quite recently, the mathematical tools used 
were largely the calculus to find maxima and minima of functions and 
the calculus of variations to find functions, production schedules, inven- 
tory schedules, and so on which optimize performance over time—dynamic 
programing, so to speak. We will not discuss these topics, for their con- 
nection with game theory is remote, but we shall sketch some of the ideas 
which led to utility theory. 

Typically, decision making under certainty boils down to this: Given 
a set of possible acts, to choose one (or all) of those which maximize (or 
minimize) some given index. Symbolically, let x be a generic act in a 
given set F of feasible acts and let f(x) be an index associated to (or 
appraising) x; then find those x‘ in F which yield the maximum (or 
minimum) index, i.e., f(x”) 2 f(x) for all x in F. 

Very often the heart of the problem is the appropriate choice of the 
associated index. In many economic contexts profit and loss are suitable 
indices, but in other contexts no such quantities are readily available. 
Consider, for example, a person who wishes to purchase one of several 
paintings. Inasense, we can assert that the essence of the problem is: how 
should the subject select an index function so that his choice reduces to 
finding the alternative with the maximum index? 

Operationally, of course, we can suppress this problem, for all we need 
to do is observe his purchase. Alternatively, we can observe his behavior 
in a host of more restricted situations and from this predict his purchase. 
For example, in an experimental study one might instruct him as follows. 
“Here are ten valuable reproductions. We will present these to you in 
pairs and you will tell us which one of each pair you would prefer to own. 
After you have given your answers to all paired comparisons, we will 
actually choose a pair at random and present you with the choice you have 
previously made. Hence, it is to your advantage to record, as best as 
you are able, your own true tastes.”” Now, it may be possible to account 
for all his choices by assuming that he has a simple ranking of the paintings 
from the least liked to the most liked, such that the subject always ahiotise 
in any paired comparison, the one with the higher ranking. If so, his 


choices can be pithily summarized by assigning numbers to the paintings 
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A “diet” simply prescribes the daily amount of each food to be consumed: 
x, units of Fj, x2 units of F», andsoon. To each diet (x1, Xa) ° = Xn) we 
can associate, i.e., determine, the nutrient yield of any given list of nutrients 
we care to be concerned about, say, iron, calcium, riboflavin, vitamin C, 
etc. We observe that the nutrient yield of the diet Gaia Pa Xn) for 
iron, say, is a linear expression of the form 


ax, + doxe + “+ * + Gnkny 


where a; represents the amount of iron in a unit amount of the food /, 
a, the amount in a unit of F2, and so on. Of course, some of the a; may 
be 0, but none may be negative. The same holds for the x;. Now, medi- 
cal research has established that there are certain minimal requirements 
of the several nutrients, say a units per day of iron. Thus, one wants to 
choose only from among those diets which provide these minimum require- 
ments. So we demand that (x1, xo, °° 
of the form 


- , x) satisfy linear inequalities 


ayx; + aoxg t ++ + + anktn 2 4. 


This is only the one for iron; there is a similar expression for each of the 
other nutrients. Obviously, these constraints can be met by choosing 
sufficiently large quantities of food without regard to cost; however, we 
often wish to choose “‘diets’’ so as to minimize cost—such might be the 
case in a hospital—and that creates a problem of some complexity. If 
pr, po, * * * , Pn are the unit prices of the foods Fi, Fy, - - - , F, respec- 
tively, then the cost of the diet (x1, x2, - * * , xn) is 


piri + poxe + Ar eras + Prin 


* Throughout the book starred sections will be found. It is unnecessary to read 
these to be able to comprehend the rest of the book, and, in some cases fe re - 
more mathematical sophistication to be understood. Within unstarred eye pha 
paragraphs are in small print, and to these the same comment applies ae 
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capitals. What ‘s the shortest route he c . is 
are 47! different feasible acts (not 48! because th i] 
where an act is a directed path which touches 
each act the associated index is the total distanc 
Both of these problems have one aspect which 1 
from the linear-programing problem, namely, the set 
finite. Thus, in a sense, these problems are conceptu ally | 
qa finite number of steps the ‘ndices of all the acts can be cl ecke d 
optimal one chosen. In practice this will not do, not even with 
high speed computers, for n! is a fantastically large number even for 7” 
of moderate size, e.g., 20! = 2,432,902,008, 176,640,000. ce 
One way to solve the personnel assignment problem is to embed it into 


an follor 
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a linear-programing problem. It can be shown that the finite feasible 
set F can be enlarged to an infinite set F*, and that the association of 
indices to each act can be extended to the new set in such a manner that 
the enlarged problem is a linear-programing problem and the solution 
to the enlarged problem is actually an act in F. Thus, the solution to the 
enlarged problem is also a solution to the original problem. (The linear- 
programing formulation for 5 cities is given by Kuhn [1955] and for 7 by 
Norman [1955].) Paradoxically, but very common in mathematics, by 
complicating the problem tremendously we have rendered it more amena- 
ble to analysis. Since the linear-programing problem is in turn related 
to the zero-sum game, we can see in how devious a way game theory can 
enter into the picture. Surprisingly, in this case and others, the induced 
game-theoretic problem has a neat substantive interpretation which can 
aid in making quick intelligent guesses as to approximate solutions of the 
original problem. We shall meet this sort of thing again in Chapter 13 


when we turn to the connections between statistical inference and game 
theory. 


2.4 INDIVIDUAL DECISION MAKING UNDER RISK 


The problems of making decisions under risk first appeared in the 
analysis of a fair gamble, and here again the desire for a utility concept 
arose. Consider a gamble in which one of n outcomes will occur, and let 
the possible outcomes be worth aj, a2, --* , a, dollars, Peace 
Suppose that it is known that the respective probabilities of these aie 
comes are fi, f2, * * * , Pn, where each p; lies between 0 and 1 (inclusive) 


and their sum is 1. How much is it worth to participate in this gamble? 
The monetary expected value is 


b= aipi + aopet+ >>> + ana, 
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which does not sum to any finite number. It follows, then, that o 
should be willing to pay any sum, however large, for the privilege of par- 


ticipating in such a gamble. As a description of behavior, this is sill 
As Bernoulli emphasized, people do not, and will not, behave in accord 
with the monetary expected value of this gamble. 

Bernoulli suggested the following modification of the analysis in ord 
to rescue the principle that people behave according to an expected valu 
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vidual which, in some sense, represents his choices amon; | 
which has as a consequence the fact that the expe ses : sperm 
represents the utility of the corresponding gamble. ie e ee : : oe 
previous examination of utility that this would be a hopeless aim lt we 


considered only a finite number of certain alternatives, but, once we admat 
all possible gambles among a set of alternatives, we are dealing with oe 
infinite set and by its very size there will be many more constraints on the 
utility function. Very roughly, von Neumann and Morgenstern have 
shown the following: If a person is able to express preferences between 
every possible pair of gambles, where the gambles are taken over some 
basic set of alternatives, then one can introduce utility associations to the 
basic alternatives in such a manner that, if the person is guided solely by 
the utility expected value, he is acting in accord with his true tastes—provided 
only that there is an element of consistency in his tastes. For the moment, 
let us ignore the exact statement of this proviso, which is important; we 
will return to it in the next section where we present a formal statement of 
one form of their result. 

There are two points to be emphasized about this result. First, the 
utility function so constructed reflects preferences about the alternatives 
in a certain given situation, and so it will reflect not only how the subject 
feels about the alternatives (prizes, outcomes, or stimuli) in the abstract, 
but how he feels about them in the particular situation. For example, the 
resulting function will incorporate his attitude towards the whole gambling 
situation. Second, the utility associations are introduced in such a man- 
ner as to justify the central role of expected value without any further 
argument, specifically, without any discussion of long run effects. 

The essence of their idea can be illustrated simply. Suppose that our 
subject prefers alternative A to B, B to C, and A to C. Any three num- 
bers a, 6, and ¢ which decrease in magnitude are suitable indices to reflect 
this ordinal preference. But, remember, we are ad 
Suppose we ask his preference between: (i) obtaining B for certain, and 
(ii) a gamble with A or C as the outcome, where the probability that it is 
a is p and the probability that itis Cis1—p. We refer to these as the 

certain option” and the “lottery option.” It seems plausible that if 

p is sufficiently near to 1, so that the outcome of the lottery option is ve 
likely to be A, the lottery will be preferred. But, if p is near 0, then pty 
teed ie ve ee . As p changes continuously Sata 1 to 0 
ry option must change into preference for the 
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words: 

j. Any two alternatives shall be comparable, i.e., given any two, the 
subject will prefer one to the other or he will be indifferent between them. 

ii. Both the preference and indifference relations for lotteries are transi- 
tive, i.e., given any three lotteries A, B, and C, if he prefers A to B and B 
to C, then he prefers A to C; and if he is indifferent between A and B and 
between B and C, then he is indifferent between A and C. 

iii. In case a lottery has as one of its alternatives (prizes) another lottery, 
then the first lottery is decomposable into the more basic alternatives 
through the use of the probability calculus. 

iv. If two lotteries are indifferent to the subject, then they are inter- 
changeable as alternatives in any compound lottery. 

v. If two lotteries involve the same two alternatives, then the one in 
which the more preferred alternative has a higher probability of occurring 
is itself preferred. 

vi. If A is preferred to B and B to C, then there exists a lottery involving 
A and C (with appropriate probabilities) which is indifferent to B. 


2.5. AN AXIOMATIC TREATMENT OF UTILITY 


The purpose of this section is to make precise both the consistency 
requirements and the theorem which we discussed informally in the last 
section. We shall adopt a set of axioms which are a bit different from 
those already available in the literature. At some, but relatively unim- 
portant, expense in generality, we can employ axioms which are extremely 
simple and which lead to the utility numbers quite directly. For other 
axiom systems the reader is referred to von Neumann and Morgenste 
[1947], Herstein and Milnor [1953], and Hausner [1954] ‘ ie 
As we present these axioms, it is well to have some inter 
in mind. We suggest the following: Su 
choice between a pair of lotteries which ar 
risky alternatives. Because of their com 
cult to decide which one is preferable. 


pretation of them 
ppose that one has to make a 
€ each composed of complicated 
plexity it may be extremely diffi- 
A natural procedure, then, is to 


ing it into sim] 
these alterna 
A simple 
ih h relate the sit p 
., consistent P< 
Our 


ment to certal Imp 
hat utility numb ‘oducey 


| n rule: 
| . 
| ones + 4 commit 


: Jus 
alternatives, cs, in the sense t 


| Bia sum a ag we introduce each assump' axiom), 

| | : it will restrict the cability of « 
| | lly to see just how it npI A be J 
de] must, inevitably, be a compr + between wid 
hrough less restrictive assumptions and riche 


| and wid hematical representation through stronge 


| and more elegant mat 


tions. aa =e | 
Beste is little practical loss of generality if we suppose that all lotter;: 


are built up from a finite set of basic alternatives or prizes, which y 
| denote by 4;, 42, ° °°; A,. A lottery ticket is a chance mechaniy, 
. Which yields the prizes A;, A, - - - , A, as outcomes with certain know, 
Wh | probabilities. If the probabilities are p1, f2, - - - , p,, where each ; ) 
| and the sum is 1, then the corresponding lottery is denoted by (py 
ie p242, ~~~, ,A,). We interpret this expression to mean only this: ont 
| | and only one prize will be won and the probability that it will be 4; ish 
| : | | nal one can think of a lottery as the following experiment: ! 

| § unit circumference is subdivided into arcs of lengths pi 


es en aaa ; om 
os of le ee and “fair” pointer is spun which if it comes to rest in th 
‘| hgth p; means that prj 


\ 

efinit . 
ia ng Htely assuming that there is no conc l di . ‘oning obi 
i) | Probabilities to the ev i ee 0 Y wasignits” 


= 


Sears 
S = 


to ad 


. 7 wilt 
_, Admit a fre ; That is to say, we are 4 
Probabilities dvency interpretation ; 
en 


of probability when assigo” 
Point of 4 o ee however, view the lottery a 
ROt Something 4.1 ungle entity that will be contu 
aie be Tepeated many times. This rest 
© Conceptual pry iY’ Probabilities will permit us 

Problems of game theory. Those proble 


aVin § ili 
ee ouch abstruse events that the proba 1, 


e Uncle > : ter 
Ncerneg wi a will be deferred until on if 
0 individual’s choice betwee? 


to th 


1 


ESS" i} }.}» 


] An Axiomatic 
2.5 

of lottery tickets L= (piA1, p22 ae 
spe br A.) ifivdec preferred to L’, this m« 

; : - 1 ] me 
ated with L to that as 


the experiment assocl 
the symbolism 


Among the basic prizes, We use su 
A; is not preferred to A; Equivalently, 


indifferent to Aj. 


Assumption 1 (ordering of alternatives). * 
ence? ordering, 7, holds between any two prizes, and 1 
for any A; and A, either A; *¢ A; or Aj > Aj and if Ay [Aj Na AG ~ 


A; e Ax. 


These assumptions can be criticized on the grounds that they do not 
correspond to manifest behavior when people are presented with a 
sequence of paired comparisons. This can flappen even over time 

) periods when it is reasonable to suppose individual tastes remain sta- 
tionary. There are several possible rationalizations for such intransi- 
) tivities. For one, people have only vague likes and dislikes and they 
make “mistakes” in reporting them. Often when one is made aware of 
intransitivities of this kind he is willing to admit inconsistency and to 
realign his responses to yield a transitive ordering. See Savage [1954, 
pp. 100-104] for a penetrating discussion of an example due to Allais 
which traps people, including Savage, into inconsistencies. Once the 
inconsistency is pointed out, Savage claims that he is grateful to the theory 
for indicating his inconsistency and he promptly reappraises his 
evaluations. 

A second rationalization asserts that intransitivities often occur when a 
subject forces choices between inherently incomparable alternatives. 
The idea is that each alternative invokes ‘‘responses”’ on several different 
“attribute” scales and that, although each scale itself may be transitive, 
their amalgamation need not be. This is the sort of thing which psycholo- 
gists cryptically summarize by terming it a multidimensional phenomenon. 

No matter how intransitivities arise, we must recognize that they exist, 
and we can take only little comfort in the thought that they are an 
erie aoe Ba =e theory in the behavioral sciences 
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by Harold Kuhn. Throughout that city ar¢ 
prizes tickets in the National Lottery.) 


Assumption 3 (continuity). Each prize A, fi 
ticket involving just A; and A,. That is to say; there | i 7 
that A; is indifferent to [ujAi, 0A2, °° * » VAr—ay VE MUA Th rt y 
venience, we write A; ~ [uwA1, (1 — u;)A;| = Ai, but note welt that A; and 
A; are two quite different entities. 


This is a continuity assumption. If 4; > A; ~ A,, 118 plausible 
[pAi, (1 — p)A,] is preferred to A; if p is near 1, and that the prefe 
inverted if p is near 0, so it is also plausible that as p is shiftec 
there is a point of inversion when the two are indifferent. 

Although this assumption seems plausible, at least as a criterion of con- 
sistency, there are examples where it does not seem universally applicable. 
It is safe to suppose that most people prefer $1 to $0.01 and that to death. 
Would, however, one be indifferent between one cent and a lottery, 
involving $1 and death, that puts any positive probability on death? 
When put in such bald form, some, whom we would hesitate to charge 
with being “irrational,” will say No. At the same time, there are others 
who would argue that the lottery is preferable provided that the chance 
of death is as low as, say, one in 10!9°° for such an event is a virtual 
impossibility. Even though the universality of the assumption is suspect, 
two thoughts are consoling. First, in few applications are such extreme 
alternatives as death present. Second, even if assumption 3 is neither 
explicitly assumed nor a consequence of other assumptions, a utility calcu- 
lus can be derived. A single number will no longer suffice; rather, an 
n-tuple is needed; nonetheless, a good deal of game theory can be con- 
structed on this more complicated utility foundation. We will not 


describe this theory of n-dimensional utilities; the interested reader can 
consult Hausner [1954]. 


eed 


Assumption 4 (substitutibility). In any lottery L, A; is substitutable for Aj, 
that 1S, (pid ope. |. piAi, tase p,A;) Ping (piA 1 eS ener) bids, eee he 
pry). 


This assumption, taken with the third, is reminiscent of what is known 
in other work as the assumption of the independence of irrelevant alternatives: 
this we shall discuss in Chapter 6 and again in Chapters 13 and 14 If 
one asserts A; ~ A;, then in view of assumption 4 we also assert that not 
only are they indifferent when considered alone but also when substituted 


in any lottery ticket. Thus, the other possible alternatives must be 
irrelevant to the decision that they are indifferent 
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With these six assumptions we are done, for if two lotteries L anc 
given, the first five assumptions permit us to reduce them to the form of 
lotteries in assumption 6, and then we decide between them on the basis 


ci < 
( ) 


of assumption 6. That is, for lotteries L = ($141, ° °°, prd,r) and 
L’ = (p1'Ai, * + + , pr'A,), we compute 
pitr t+ poets ++ + fur and = py’uy + po'ug + ++ + + py'ur, 


and if the former is larger we prefer L to L’, if the latter L’ to L, and if they 
are equal L and L’ are indifferent. Put as a formal theorem: 


If the preference or indifference relation ~ satisfies assumptions 1 through 6, 
there are numbers u; associated with the basic prizes A; such that for two lotteries 
L and L’ the magnitudes of the expected values 


Pili + pote t+ + prt, and = py'uy + polug ts - meee 


reflect the preference between the lotteries. 


Let us introduce the following terms which will be used in the rest of the 
book. Ifa person imposes a transitive preference relation > over a set of 
lotteries and if to each lottery L there is assigned a number u(L) such that 
the magnitudes of the numbers reflect the preferences, i.e., 4(L) > u(L’) if 
and only if L > L’, then we say there exists a utility function u over the 
lotteries. If, in addition, the utility function has the property that 
ulgl, (1 — g)L’] = qu(L) + (1 — g)u(L’), for all probabilities g and 
lotteries L and L’, then we say the utility function is Jinear.2 The above 


‘ ; , 4 
Sometimes this property is referred to as the expected utility hypothesis since it asserts 
that the utility of a lottery is equal to the expected utilit 


tility I t y of its component rizes. 
pes only is this terminology more explicit (if less brief), but it would help shai: 
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A similar calculation shows that 


Thus, if this person is to be consistent wi" 1 O 


same time have the stated indifferences Detwe 
2 ru ; 
Az; and A3, then’he must prefer LT tok. 


Two possible linear utility indicators are given 1n the folowing 


Lottery| Ai | Ag A3 Ag | (p1A1, p2Ae p4A4) 
u 1.0 0.6 Bee 0.0| pi(1) + p»(0.6) = p3(0.2) + p4(0) 
oT 2e | O.8 | 0.0 | —0.4| pi(1.6) + p2(0.8) + p3(0) + pa(—0.4) 


The first of these is the one described above, and the second one is the 
linear transformation of it obtained by using the constants a = 2 and 
b = —0.4. 

Given that a subject’s preferences can be represented by a linear utility 
function, then he behaves as if he were a maximizer of expected values of utility. 
It is important to recognize that a subject’s manifest behavior may be 
summarized by a linear utility function without his being consciously 
aware of making his choices in this manner. About his subconscious 
awareness we will not comment. 

The general theory of utility is not confined to a finite set of basic alter- 
natives nor to cases where a least or a most preferred alternative exists. 
We have only examined a simple special case, but one with sufficient 
complexity so that we can see just what is involved when we use utility 
theory in game theory. If one is interested in the more general theories, 


which are correspondingly more complicated, see the papers referred to 
at the beginning of this section. 


2.6 SOME COMMON FALLACIES 


Newcomers to modern utility theory tend to be critical of the idea, and 
. ele . ? 
to be sure, there are valid reasons, but as criticisms are so often based on a 


fallacious understanding of the construct we have elected to point out some 
of the more common misinterpretations. 


Fallacy 1. (f,A3, - >." .. typAz). 38 preferred to Cy are » fr A,) 


because the utility of the former, piu; + + + + + pup, et 
r, 0S larger th 
the latter, py’uy + +++ + Pr!tp. pr ger than the utility of 


Some care must be taken to see why this is a fallacy, for there are two 
quite distinct ways of interpreting utility theory. First, we may think 
of the theory as a description of preference, in which case the causal 
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Fallacy 2. Suppose that A >B>C>D and ‘ol the utilities of the 

alternatives satisfy u(A) + u(D) = u(B) + u(C), then (6B, 14C) should h 

preferred to (14A, Y4D) because, although they have the same expected utility, th 
former has the smaller utility variance. 


This is a completely wrong interpretation of the utility notion, and 
again it results from a failure to accept that preferences precede utilities 
It misses the point of utility theory. The principal result of utility theory 
- os is that a linear utility index can be defined which reflects com- 
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Or 


9.7 INTERPERSONAL COMPARISONS 
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There is one thing which we stress 
attempts to devise a numerical utility for decis! 
which we have not adequately discussed for u 
is risk: the uniqueness of the function. Under cert 
difficulties was the almost complete lack of uniqueness—an ae 
serving transformation of the numbers was equally acceptable. It is also 
true in the risky situations that any order-preserving transformation of a 
utility function is again a utility function, but such a transformation of a 
linear utility function does not generally result in a linear utility function. 
One must, therefore, keep in mind the class of transformations which take 
a linear utility function into one of the same type. As we pointed out 
before, the appropriate class consists of those transformations known as 


the positive linear ones, i.e., if u is a linear utility function over a set of 


risky alternatives and if a and 6 are any constants so long as a is positive, 
then uv’ = au + b is again a linear utility function over the set. Con- 
versely, if u and uw’ are two linear utility functions for a preference relation 
over the same set of alternatives, then there exist constants a and b, where 
a is positive, such that u’ = au + 6. 

Another way of stating this uniqueness result is that the consistency 
axioms (such as assumptions 1 through 6 of section 2.5) determine a 
linear utility function which is unique up to its zero point and its unit. 
If we choose any two alternatives which are not indifferent, then we can 
always set the utility of the less preferred to be zero and the utility of the 
more preferred to be one. As we shall come to see, the non-uniqueness of 
the zero point is of no real concern in any of the applications of utility 
theory, but the arbitrary unit of measurement gives trouble. The trouble 
may be illustrated most easily by a fictitious example in the measurement 
of distances. Suppose two people are isolated from each other and each 
is given a measuring stick marked off in certain and possibly different 
arbitrary units. The one subject is given full-scale plans for an object 
to be constructed to the same size by the other, and he is permitted to send 
only messages stating angles and lengths (in his units) of the object 
With such limited communication it is clearly possible for the second — 
to construct a scale model of the object, but it will only be of the corr 
size if it happens that both measuring rods are marked in the sam neh 
Clearly, once the barriers on communication are d d Wi 
can determine with fair accuracy the relationshi fie ee ee 
ie Saeeeeriho HE h sonariD between their two units 
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for it is correspondingly more intense. wh 
we do not know, but it seems to mean something to each of us. We, 


forever trying to decide whether one outcome means more or - 
another person than a different outcome means to us. For more disc 
| sion of these matters in the context of game theory, where they play hay 

| see Chapters 6, 7, 8, and 14. 

Thus, the fact that a linear utility function is defined only up to a line 
transformati : ice 
i one leads to the problem of interpersonal comparisons 
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comparisons, so one way OF another the ver) 
upon these. One procedure is to determine 
tives experimentally, using the assumption 

that it is linear. Suppose we arbitrarily ass! 
alternatives and then we determine the utility 
lottery of the first two which is indifferent to it. 
C ~ [pA, (1 — p)B]; then by linearity u(C) = u[p4, G 
(1 — p)u(B) = p. If this is done for several more 


knows enough values on the utility scale (assuming 


po 
it 


dictions. For example, suppose it was found that u(D) = q. inhen we 
could predict whether the lottery [rA, (1 — r)C] is preferred or not to the 
lottery [sD, (1 — s)B] for a particular choice ofr ands. If these predic- 


tions are confirmed experimentally, we then have some confidence that 
we have obtained a portion of the utility function. This is the method 
which was used by Mosteller and Nogee [1951]. 

A more elegant, if more difficult, alternative to starting with a model 
having an infinity of comparisons is to devise one in which only a finite 
number are to be made. Such a model must be quite different from the 
one we have described if it is to lead to a linear utility function which is 
unique up to a linear transformation. However, such axiom systems are 
possible as has been shown by Suppes and Winet [1955] and for this case 
Davidson, Siegel, and Suppes [1955] have devised an experimental setup 
:n which it can be checked. This work is probably the most experimen- 
tally elegant in the area, and the results have been very encouraging. 

A second difficulty in attempting to ascertain a utility function is the 
fact that the reported preferences almost never satisfy the axioms, €.g., 
there are usually intransitivities. Furthermore, if the same pair is offered 
several times, then in some cases the subject will not be consistent in his 
reports. One cannot expect the data to fit the model perfectly, but how 
does one determine which model they fit most closely and how does one 
measure how good the agreement is? Such problems pose the following 
intriguing and important statistical problem: to formulate a model which 
assumes that a subject is actually (or latently or genotypically) a von 
Neumann-Morgenstern utilitist in the sense of “having” a linear utility 
SAS but that his responses yield this underlying order confounded 
ey oe trav cinpimaecneannn on 
matic approach that such a postulate srikcinse a ne ae hae aS 
mean by a “‘reasonable,”’ or ‘‘approximate,”’ a Ag * emi 

- e,” or “‘realistic’’ fit of the data 
to theory. To date, little has been published on this problem. 
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why, then, make any measurements 10 the la! ‘ ral 
is to see if under any conditions, however ! 

model can be confirmed and, if not, to see how 
accord better at least. with those cases. 


postulate the general existence of these new CO! 


T+ varil 


feels less cavalier if he knows that there are two or 
postulates have actually been verified. | ; 
Every indication now is that the utility model, ana possibly theretore 
the game model, will have to be made more complicated if experimental 
data are to be handled adequately. Although one such complication 
of the utility model is discussed in Appendix 1, its domain of applicability 
‘s limited and it is completely unclear how it can be utilized in game 
theory. Furthermore, neither it nor any of the present utility models 
take into account the intuition, now bolstered by a staggering amount of 
empirical data for a wide variety of psychological dimensions (see Miller 
[1956] for a partial survey), that people rarely categorize a single dimen- 
sion into more than seven or so distinct levels. The major exceptions 
seem to be cases where the culture provides a simple, fine, and unam- 
bigious scale, such as money. Since, however, most decisions, even when 
money is a factor, are not based entirely on monetary considerations, dis- 
crete categorization of preferences may be the basic case to study. As 
no theoretical work has been carried out on such problems, we can only 
turn in the following chapters to what has been done to give a model for 
conflict of interest within the present utility framework. 


2.9 SUMMARY 


The primary purpose of this chapter was to introduce the central ideas 
of modern utility theory, which is a cornerstone of much decision theory. 
As background, we classified decision making according to whether a 
decision is reached by an individual or a group, and whether it is effected 
under conditions of certainty, risk, uncertainty, or a mixture of uncer- 


tainty and risk in the light of experimental evidence. Using these cate- 


| gories, we described the general structure of the book. 

{ Decision making under certainty encompasses much of formal theor 
1 in social science. This problem can be viewed as follows: given a set of 
/ ) ee acts, to choose those which maximize (or minimize) a given index 
. n many traditional applicatio i ; 
: one of the Vee but sane ee > - Hehe voce = 
r | nteresting modern prob- 


lems require more sophisticated techniques. Although we did not go into 
decision making under certainty, the example of linear programing was 


discussed briefly because of its close relation to game theory and its inh 
ent importance in applications. a 
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EXTENSIVE AND 
NORMAL FORMS 


3.1 GAME TREES 


The mathematical abstraction of a game assumes the three forms we 
mentioned in the introduction: the extensive, the normal, and the char- 
acteristic function forms. ‘The first is our topic throughout most of this 
chapter; it is an attempt to capture the salient features of certain conflicts 
of interest, such as those found in a parlor game. The rules of any parlor 
game specify a series of well-defined moves, where each move is a point of 
decision for a given player from among a set of alternatives. The par- 
ticular alternative chosen by a player at a given decision point we shall 
call the choice, whereas the totality of choices available to him at the 
decision point constitutes the move. A sequence of choices, one following 
another until the game is terminated, is called a play. These familiar 
words, particularly the word “move,” are being used in somewhat unfa- 
miliar ways. For example, in chess and other board games the word 
move is used in two ways which differ from our meaning. _ It is sometimes 
used to refer to the physical act of moving a piece from one position to 
another. It is also used in such phrases as “‘the fifth move.” This refers 
to the set of all possible moves in our sense following any of the possible 
sequences of four choices from the beginning of the game. 
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game, there must be associated a p 
over the several alternative choices. 

of a fair coin, then there are two alte1 
occur with probability 4. 


A drawing such as Fig. 2, 


when consiaere 

matical system, is known as a connected grap 

of a collection of points (called nodes or verti 

certain pairs of nodes such that a path can b¢ traced out from each point 
to every other point. A graph may have closed loops of branches, such 


as abca or abdeca in Fig. 3. A connected graph with no such loops of 
branches is called a tree. The graph of a game is a tree, which is called 
the game tree. It may not seem reasonable to assume the eraph of a game 
is a tree, for in such games as chess the same 
arrangement of pieces on the board can be 
arrived at by several different routes, which 
appears to mean that closed loops of branches 
can exist. However, in game theory we choose 
to consider two moves as different if they have 
different past histories, even if they have exactly 
the same possible future moves and outcomes. 
In games like chess this distinction is not really 
important and to make it appears arbitrary, but in many ways the whole 
conceptualization and analysis of games is simplified if it is made. The 
tree character of a game is not unrelated to the sinking feeling one often 
has after making a stupid choice in a game, for, in a sense, each choice is 
irretrievable, and once it is made there are parts of the total game tree 
which can never again be attained. 

The tree is assumed to be finite in the sense that a finite number of 
nodes, and hence branches, is involved. This is the same as saying that 
there is some finite integer N such that every possible play of the game 
terminates in no more than WN steps. Such is certainly true of all parlor 
games, for there is always a “‘stop”’ rule, as in chess, to terminate stale- 
mates. ‘To say the tree is finite is not to say that it is small and easy to 


work with. For example, card games often begin with the shuffling of a 


deck of 52 cards, and so the first 0 move has 52!, ice., approximately 
8.07 X 10°, branches stemming from it. Clearly, for such games no one 


is going to draw the game tree in full detail! 


Fig?3 


3.2 INFORMATION SETS 


The next step in the formalization of the rules of a game is to indicate 
what each player can know when he makes a choice at any move. We 
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the diagram, the rules of the game make It 1mpo: 


whether he is choosing between / and ¢ OI \ 


In general, the rules of any game must speci!) 
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moves—which are known as information sets. ach of tne Moves Ith NIM 
j wanhlat the \oves must have 

set must be assigned to the same player, and each of the moves m\ ha 


exactly the same number of alternatives. For if one move has r alterna- 
tives and another s, where s ~ r, then the player would need only count 
the number of alternatives he actually has in order to eliminate the possi- 
bility of being at one move or at the other. A third condition, which may 
be less obvious, is also assumed, namely: a single information set shall not 
contain two different moves of the same play of the game tree. The reason 
for this condition is that it seems to be generally met in practice, and hav- 
ing it makes the theory simpler. 

Returning to Fig. 4, consider player 1’s information set which has two 
moves. Since they are indistinguishable, each choice on one move must 
have a corresponding choice on the other move. It is convenient in these 
diagrams to pair them systematically, so f’ corresponds to f’’, g’ to g’’, and 
h’ toh”. It is clear that this correspondence can be generalized to infor- 
mation sets having more than two moves and other than three alternatives 
at each move. 

When an information set consists of a single move, the player is totally 
informed in that he knows exactly where he is on the tree. When all the 
moves are of this type, we say that the game has perfect information. 


Ticktacktoe and chess are examples of games whose rules result in perfect 
information. 


3.3, OUTCOMES 


The final ingredient given by the rules of the game is the outcome 
which occurs at the end of each play of the game. Almost anything may 
be found to be the outcome of some game; for example, the subjective 
reward of victory in a friendly game, or the monetary punishment of 
seeing someone else sweep in the pot, or death in Russian roulette. | 
any given system of rules for a game there is some fixed set of outco 
from which specific ones are selected by each of the plays. Each of 
end points of the game tree is a possible termination point 
and it completely characterizes the play of the game whic 


point, for there is only one sequence of choices in a tree lead 
end point from a fixed first move. 
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captured, and the second spade 1 
the only difference being that each hand } 

for the first spade. ‘The process cont 

until one of the players has spades total 

tions are possible which from the point 

different, but, given the fallibility of 

differently. Either the cards play ed fron 
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game seems to be more taut with the latter procedure, 
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To get an idea of the interest of the game, suppose the first card t 
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_ the other hand, if he plays a 7 he has won a lot at little cost to his ee 
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player 4 who must select one of two cards, 
player 2 who in the absence of knowledge of player 
one out of two cards. Naturally, the outcome ‘ f 
pletely determined by the previous moves. The | 
and the victor is the player having the largest sum ! 
Since the sum is 6, there is the possibility of draw as well as win « 

The numbers alongside the branches denote the card chosen, ane W on = 


end point means player 1 wins, D means a draw, and £ that he loses. 1! 


° } ta far eA h nlax 
other words, the set of outcomes Consists of three elements for each piaye 


W, D, and L. 


> It should be noted that for a given set of rules there is not, in general, a unique 
tree representing them. For example, it is clearly immaterial whether we put 
player 1’s move before or after player 2’s move on the tree, for the information 
sets are so chosen to make them simultaneous in effect. A more profound change 
in the tree can be effected by replacing the single chance move by two chance 
moves. We can either view chance as selecting with probability 14 one of the six 
permutations of the three cards, or, as is shown in Fig. 6, we may view the first 
chance move as selecting one of three cards with probability 14, and after players 
1 and 2 have each taken a move chance selects one of the two remaining cards with 
probability 14. The two representations are equivalent to the rules of the game. 
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3.5 EXTENSIVE FORM 


So far we have characterized what we shall mean by the rules of the 
game, but no players have really been introduced. It is true that we 
have spoken of players who will make choices at the moves, but they 
cannot be considered to be introduced until they are characterized by the 
properties which describe their behavior. It is clear that we may have 
the same set of rules but very different conflict of interest situations— 
games—depending upon the nature of the players. In total we shall 
make three assumptions about our players; these assumptions will char- 
acterize them. 

The first assumption is that each player has a pattern of preferences over 
the set of outcomes which satisfies the axioms of utility theory. It will be 
recalled that we established in Chapter 2 that if this is the case then we 


can assign a numerical and linear utility function to the outcomes. Mor 
formally, we suppose: . 


vii. For each player i, there is a numerical and li li } 
inear utility funct 
over the set of outcomes Q. halenP pee 
It is easy to see that we can combine conditions (vi) and (vii) into a 
single condition, for by (vi) an outcome is assigned to each play and b 
(vii) a utility to each outcome, hence a utility, which we may denote es 
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The system consisting of the first five parts of the rules of the gang and 
(vii’) is known as a game in extensive form. We observe that it 1s not 1dent- 
cal to the rules of the game for it includes the preference patter the 
individuals participating in the game, and so introducing different indi- 
viduals into a situation having the same rules will, in general, yield a dit- 
ferent extensive form. 
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tions on the extensive form necessary to © 
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finite number of moves. 


3.6 RATIONALITY AND KNOWLEDGE 


As we said in the last section, the players in the game are to be charac- 
terized by three assumptions, only one of which has so far been given, 
namely: each player has preferences over the outcomes which meet the 
axioms of utility theory. The other two assumptions concern what the 
players know and the basis on which they arrive at decisions. 

Let us take up the question of knowledge first. It is assumed that: 


viii. Each player is fully cognizant of the game in extensive form, 1.e., he is fully 
aware of the rules of the game and the utility functions of each of the players. 


It hardly seems necessary to point out that this is a serious idealization 
which only rarely is met in actual situations. If one systematically 
examines the several features of a game in extensive form, it is clear that 
human beings do not generally have the knowledge assumed. 

Is there, then, any reason to pursue the theory of games further except 
as a mathematical topic; can it possibly have any relevance to social 
science? The answer Yes can be supported by three distinct reasons. 
First, the theory can be used normatively to tell a person that this is the 
knowledge he should acquire, and, once he has it, the theory establishes 
the decisions he should make in order to achieve certain specified ends. 
Second, the knowledge assumption does not seem nearly so grave when we 
move to the next level of abstraction—the normal form. Since most of 
the theory is actually at that level, we should not yet abandon it. Third 
it may be possible to weaken the knowledge assumptions of the elagcsiess 
game model. In section 12.4 we shall briefly examine a model where 
each of the players is supposed to have “‘misperceptions”’ concerning the 


utility functions of the other players; in that model each player has only 


partial knowledge of the entire postulated structure. We do not want to 
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The logical quality ulate . 
shall take it to be entirely tautological in character in the sense that th 


postulate does not describe behavior but it describes the word “prefer. 
ence.” With this interpretation, the problem is not to attempt to verif 
the postulate but rather to devise suitable empirical techniques to deter 
mine individual preferences. It is not the least bit obvious that the 
“preference”’ implicitly defined in (ix) will be the same as the ‘“‘preference’ 
defined by an arbitrarily given set of experimental Operations, such a 
asking a person to state his preferences amon 
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3.7 PURE STRATEGIES AND THE NORMAL FOR! 
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One way to ascertain the outcome of a game 1n exten ive form is to let 
the players play it and observe the outcome. Indeed, many would say 
this is the only way, but they would be wrong, for in principle we could 
cause each player to state in advance what he would do in each situation 
which might arise in the play of the game. From this information for 
each of the players, an umpire could carry out the play of the game without 
further aid from the players and thereby determine the payoffs. Such a 
prescription of decision for each possible situation is known as a pure 
strategy for a player. 

For many games the actual preparation of a pure strategy in a form an 
umpire could use without ambiguity is a hopeless task; however, certain 
simple examples of pure strategies are easily given, though in general they 
would be poor ways to play. For example, if we suppose that each branch 
stemming from a move is given a number, 1, 2, °°: , 7, where r is the 
number of branches, then one pure strategy is to take branch 1 at each 
move. Another is to take the branch with the largest number. Indeed, 
if player 7 has q different information sets, which we may number 1, 
2, °° °,q, then any pure strategy can be represented by a set of g num- 
bers, where the jth number represents the branch chosen when, and if, the 
play reaches the jth information set. Thus q-tuples of integers, 


(1, 925 a ia me 


represent pure strategies. For example, the strategy in which ‘branch 1 
is always taken is represented by 


eae, ee 


But each y; has as its range only a finite number of integers, since each 
move has only a finite number of branches, and there are only a finite 
number of y’s, namely g, so there are only a finite number of strategies 
Without any loss, we may label the strategies by numbers 1, 2, - - - t 
where ¢ is the total number of strategies available to the —— The 
number ¢ is finite, but it need not be small. A game having but 10 infor- 
mation sets for a player and 10 branches at each set is exceedingly simple 
but it can have millions of essentially different strategies for the spe 
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| | Let 5; be the label for player :’s typical strategy choice, and 5S; be th 
1] | label for the set of all his possible strategy choices. Now, as we pointed 
| out, when each player has selected a Strategy, an umpire is in a position 
) | } to play the game and to determine the payoffs. That is to say, from the 
ae ae of the game in extensive form we may determine a payol 
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We are justified in forming this sum |! 
comes are measured in a linear util 
utilities M@;(a) and M,(8), then the utilit 
a or B arises, the former with probability p an 
1 — p, is equal to pM(a) + A — p)MAB 

Observe that by means of the strategy notion every § 


form has been reduced to a game of the following torm: €a nh player tae 


exactly one move (a choice among his several strategies), and ne 


} 1. 3 are 
nd he makes nis 


choice in the absence of any certain knowledge about the choices of the 
other players. The payoff to the players is determined from the func- 
tions M; and the values of 5;. This is a reduction of every game to a 
simple standard form which is called the normal form of a game. 

What sleight of hand is this?’ We began by abstracting parlor games 
and arrived at the extensive form of a game, which, in general, led to an 
oppressively complex game tree. For games of any reasonable com- 
plexity, the number of possible trees and of variations arising from different 
information sets immediately led us to believe that there is little hope of 
finding detailed classifications of games in extensive form or of analyzing 
player behavior at that level. Then, by introducing the idea of a pure 
strategy, we suddenly reduce all games to a comparatively simple standard 
form. Thus, the sleight of hand is to trade the conceptual complexity 
of a game tree for the numerical tedium of listing all available strategies. 

The reduction of any specific game, except the simplest, to normal form 
is a task defying the patience of man; but, since the normal form of all 
possible games is comparatively simple, one may hope to carry out suc- 
cessfully a mathematical examination of all possible games in normal form. 
The study of specific games may be close to impossible, but it may now be 
quite feasible to classify, analyze, and determine the features of all games. 
For some empirical purposes that may be sufficient. 

We see that the normal form of the game is exactly the general problem 
which was evolved and discussed in the introduction to the book: each 
player has limited control over the variables which determine what he 
shall receive, and each wishes to maximize his return. 


3.8 SUMMARY 


More than half of this chapter was taken up with a detailed description 
of the concepts required to present a conflict of interest in extended form 
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which is a radical conceptual simplification of the extensive form 

achieved by introducing the idea of a pure strategy: a detailed prescription 

of what a player shall do in each eventuality. Given this notion, a game 


¢ 


in normal form consists of: 


i. The set of n players. 

ii. _n sets of pure strategies S,;, one for each player. 

iii. n linear payoff functions M,, one for each player, whose values 
depend upon the strategy choices of all the players. 


As in the extensive form, part of the description of the players is con- 
tained in the payoff functions and the rest is given by a knowledge assump- 
tion (now of the game in normal form) and a rationality assumption. 
Each player attempts to maximize his utility in a situation where his 
outcome depends not only upon his choice, but upon the choices of each 
of the other players; in turn, their choices are influenced by the choice 
they think he is going to make, for they too are attempting to maximize a 
function over which they do not have full control. 
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fixed-point theorem in topology an th 
bodies. ‘Third, the economic theories 
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games. Fourth, Wald’s contributions to th 
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ultimately the theory of games nll be c 
because of its historically significant re 
than for its own sake. Finally, one lc 
matical elegance of the two-person theory has served 
ticians to the study of mathematical applic tion 
Thus, even if the final verdict should be that game theory has nothing su 
to contribute to the social sciences—this we do not believe will be the ¢ 
assuming it so—the by-products of the two-person theory would still be significant 
for the social sciences. 4 


All that has been said about the role of two-person theory is, in fact, 
about a special class of two-person games: those which in normalized 
form have a finite number of pure strategies and have a special property 
which later we shall call ‘‘zero-sum.” This chapter is devoted to this 
class of two-person games. 

To find what we mean by a two-person game in normalized form, the 
reader need only specialize the general definition of section 3.7 to! tne 
case n = 2. Thus, we have two players 1 and 2 and a game in which 
each player has only one move—the moves are taken simultaneously or, 
what amounts to the same thing, they are taken in succession in such a 
manner that the player who moves second does not know the choice made 
by the other player. We may denote the two sets of pure strategies as 
follows: 


Sj =A= (a1, QO ar ital 
S's = B= {B1, Bo, Pty stint: 


The assumption that the game has a finite number of pure strategies is 
built into the notation we have used, for the number m denotes the number 
of strategies available to player 1 and n the number available to player 2. 

To a strategy choice for each of the players, a; for player 1 and 8; for 
player 2, there is a certain outcome which will be denoted O;;.. An shite 
come O;; can take on a wide variety of possible interpretations, such as: 


i. Player 1 pays player 2 $10. 

ii. Player 2 “‘wins’ the game. 

ili. Player 1 is killed and player 2 is maimed. 

iv. Player 1 can choose any book he wants from player 2’s librar 
v. Player 2 receives player 1’s record collection. 3 
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Player 1, through his choice of an alternative from A, attempts to con- 
trol the outcome according to his tastes, and player 2 similarly strives for 
an outcome he desires by his choice from B. The problem, then, is: 
under the given conditions, what choices should the players make? Or alterna- 
tively, if you were player 1, what should you do in order to achieve your 
desires? The study of two-person games amounts to formulating possible 
meanings for these questions and giving answers to them. Our first step 
toward formulating the question is to limit, for the time, the class of games 
to which it applies—to limit it to those games where the one player has a 
preference pattern exactly opposite to that of his opponent. 


4.2 STRICTLY COMPETITIVE AND NON-STRICTLY 
COMPETITIVE GAMES 


The analysis of some games is trivial and, presumably, inherently dif- 
ferent from non-trivial ones. For example, if both players have the same 
preference pattern over the outcomes, then everything is trivial since both 
players prefer the same outcome above all others. There is no conflict of 
interest. It is doubtful that one should try to incorporate the analysis 
of such games with, say, those of the opposite extreme: if player 1 prefers 
outcome x to outcome y, then player 2 prefers y to x; and if 1 is indifferent 
between x and y then so is player 2. In such cases, we say that players 1 
and 2 are strict adversaries of each other and that they have strictly opposing 
preference patterns for the outcomes of the game. If the players of a 
game have strictly opposing patterns among the lotteries of outcomes 
then we shall call it a strictly competitive game. 

Are there in fact such games? Or better, are there games for which this 
category is not too gross a miscarriage of the mathematician’s privilege of 
abstraction? One might be tempted to take war as the most extreme 
example of interest conflict, but at the global level it is probably not 
strictly competitive since both factions presumably prefer a draw to mutual 
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(see footnote 2, p. 29). 


Care must be taken in treating games whose outcomes require monetan 
exchanges as strictly competitive. Consider the following exam 

| requires player 1 to pay player 2 $2, and outcome y requires 1 to pa) 
| $500 with probability 1499, and nothing with probability 499%, 0. 
. B | upon the players’ attitudes toward gambling we may find both p1 

or both y to x, or one x to y and the other y to x. Nonetheless. ou; general point 
| remains that there are some games which can be usefully treated as if they ar 
strictly competitive, and equally there are others such as the labor-management 
game, the duopoly or duopsony game, the game of foreign trade between ty 
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1 knew what 2’s choice would be, there would be no difficulty in determin- 
ing his best counterchoice: 


Let us first examine this game from player 


If player 2’s choice is: Bi Bo B3 Bs 
Player 1’s best counterchoice is: @j Q@30r a4 Ae a4 
The return to 1 is: 18 4 8 25 


Since the counterchoice does in fact depend upon player 2’s choice, and 
since at this level of analysis that choice is not known, it is unclear what 1 
should do. It is, therefore, ‘‘natural” for player 1 to attempt to see 
whether his opponent’s analysis of the situation would lead to a specific 
strategy choice for 2 which 1 would counter with the appropriate response, 
e.g., if it turned out that 2 should clearly take 83, then 1 should take 


a>. Thus, player 1 is led to prepare a similar table for player 2’s best 
counterchoice: 


If player 1’s choice is: Q1 A. A3 Ag As 
Player 2’s best counterchoice is: 83 Bi Be Bs Bs 
The return to player 1 is: Se: ee Pe 


Again, player 2’s choice depends upon player 1’s choice, which would be 
unknown to player 2, and therefore at this level of analysis player 1 has 
no way of deciding upon the best strategy. We have a fully circular 
argument. 

On the other hand, these tabulations do suggest a possible resolution for 
player 1. From the second tabulation, he sees that if he takes a, he is 
assured an amount 0, similarly for a, and a5, but from a3 he can be sure 
of 4 and from a4 of 2. Let us say that the least amount he can receive 
from a strategy choice is the security level of that choice. Strategy a3 has 
the property that it maximizes player 1’s security level, i.e., by adopting a 
he can guarantee himself a return of at least 4, and no other strategy te 
guarantee a return of as much as 4. 

Let us repeat this same argument taking player 2’s point of view. Fro 
the first table it is seen that strategies 8), Bo, 83, and B, have aecuatiey tava 
for player 2 of —18, —4, —8, and —25 respectively; thus, 8. maximizes 
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Should player 1 choose a3? Certainly (ii) is not a very convincing 
argument if player 1 has any reason to think that player 2 will not choose 
Bo, which he might expect if he feels the above analysis exceeds 2’s capa- 
bilities. Also, (i) implies a rather pessimistic point of view; to be sure, a; 
does guarantee at least 4, but it also yields at most 5. He might reason 
that player 2 would be tempted to take 83, for the sum of payments in this 


column i ini nas 
isa minimum. Furthermore, 83 has the property that it is the 


best counterchoi ; 
1 would ad eee against strategy a4, and player 2 might anticipate that 
adopt a4 since it yields the m y 


can give a rationalizati aximum row sum. Thus, player ! 
as fess aeeeaan ee? choosing 83, and thus he would select a: 
; uld expect a return of 8. One may 
ee Player 2 might be aware of this 
So i Ich is best agai sae Instead of taking @3 he will actu- 
holding 1 down to a payment of 0. 
think-thatshe-thinke en tins this sort of “I-think 
Bike S*** ” reasoning to the point 
will gy ' Eve ©qually reasonable. 
alas him 4, that a though he comes to the realiza- 
1d a8 that a a4 _* pn hold him down to tha! 
FS not bel; € should pl 2 are in equilibrium, it is still a 
Y Bs, he Ef for any of numerous reaso™ 
Would be ill-advised to play %* 
aHempt to prescribe what he should 
Suarantee himself 4 by selectins 


4.4] An A Prio 


a3 and that no other choice has a high < 
the theory is careful to avoid sa} ing 
One might argue: if it is pointed out 
in equilibrium, then 1 should choose a: il 
theory saysso. If we were player 1 in 
a3, but we would not call another “irrational” 1 1 otherw Even 
if we were tempted at first to call an @3-no! ; 
would have to admit that player 2 might | 


would be “rational” for player 1 to 
1 
D 


yeCause we 


conformist. We belabor this point 
the social scientist recognize that game theory 1s not descriptive, but rather 
(conditionally) normative. It states neither how people do behave nor 
how they should behave in an absolute sense, but how they should behave 
if they wish to achieve certain ends. It prescribes for given assumptions 
courses of action for the attainment of outcomes having certain formal 
“optimum” properties. These properties may or may not be deemed 
pertinent in any given real world conflict of interest. If they are, the 


theory prescribes the choices which must be made to get that optimum. 


4.4 AN A PRIORI DEMAND OF THE THEORY 


In the preceding section we noted a particular property of the pair of 
strategies (a3, B2) which we may formalize as a demand to be met by any 
theory of strictly competitive games. It seems plausible that, if a theory 
offers a;, and 8;, as suitable strategies, the mere knowledge of the theory 
should not cause either of the players to change his choice: just because the 
theory suggests 8;, to player 2 should not be grounds for player 1 to choose 
a strategy different from aj); similarly, the theoretical prescription of aj, 
should not lead player 2 to select a strategy different from 8;,. Put in 
terms of outcomes, if the theory singles out (a; 3, By,), then: 


i. No outcome O;;, should be more preferred by 1 to Oj);o. 
ii. No outcome O;,; should be more preferred by 2 to Ojo3o. 


Any a;, and 8 ;, satisfying conditions (i) and (ii) are said to be in equilib- 
rium, and the a priori demand made on the theory is that the pairs of 
strategies it singles out shall be in equilibrium. 

There is no serious loss of generality if we replace the outcomes O;; by 
their utilities a;; for player 1 and b;; for player 2. Since the two players 
have strictly opposed preferences for lotteries of outcomes, the utility of 
the first person is in effect exactly the negative of the utility of the second 
one. The only ambiguity that exists is in the value of the units and zero- 
of these functions, which are not determined in the von Neumann- 
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its row ip. For example, in the game discussed in section 4.3, (a3, 8) is 


in equilibrium since the entry 4 in the third row and the second column 
is the maximum of its column and the minimum of its row. 

This notion of an equilibrium pair, though abstractly arrived at, is not 
BDU’ figment of the theoretical mind; it has its counterpart in such prac- 
tical affairs as battles. Haywood [1950, 1954] has explored the relation 
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It is easily seen that there is one equilibrium point: (northern route, 
northern route), with an expectation of two days of bombing. ‘These in 
fact were the choices made; the convoy was sighted about one day after it 
sailed; and the Japanese suffered severe losses. However, as Haywood 
emphasizes, “Although the Battle of the Bismark Sea ended in a disastrous 
defeat for the Japanese, we cannot say the Japanese commander erred in 
his decision.” [1954, p. 369.] Given the total strategic situation, which 
was not particularly bright for him, his choice was wise in the sense that 
his northern route strategy was at least as good as his southern route 
strategy against either one of Kenney’s strategies. 


4.5 GAMES WITH EQUILIBRIUM PAIRS 


There are a number of questions that come to mind about equilibrium 


pairs: questions of existence, uniqueness, and properties that they may 
possess. These we must examine. 


i. Do all strictly competitive games have equilibrium pairs? 


The answer is No. This is easily seen by exhibiting an array, [a;5] 

. * . . . > 

where there is no entry which is both the minimum of its row and the 
maximum of its column, for example, 


ai 
2.4} 


Thus, we are forced to divide the totality of zero-sum games into two 
classes: those which have an equilibrium pair and those which do not 
We postpone discussion of games of the latter type to sections 4.7 and 4 8. 
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The strategy a;, is said to be a maximin strategy since 79 has the property 
that it is a maximizer over 7 of the expression min a;;. 
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For player 2, an analogous situation obtains. Let max a;; be called 
i 


the security level of strategy B;, and so, to obtain the best security level, 


player 2 should choose 8; to minimize max a;;._ It follows readily that 2 
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can achieve this goal by choosing an equilibrium strategy 8;, and that 


this choice gives a security level of MAX 4; jo = Gioj) Thus, 
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To summarize, an equilibrium strategy not only attains the best security 


level for player 1 but it is also good against that strategy of player 2 which 
attains his best security level. 


v. If a strategy maximizes a player’s security level, is it an equilibrium strategy? 


* To see this assertion, we note that 
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Morgenstern [1947]). It will be recalled (section 3.2) that a game has 
perfect information if at any move the player has complete and unam)h 
ous information concerning the choices made at previous moves. This 
condition is sufficient, but it is by no means necessary; for a necessary and 
sufficient condition see Dalkey [1953]. 
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which maximize a 
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choosing B;. Not only do a2 and @; attain the maximum security levels, 
but B; is good against ag. ‘These two 
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a, than a2. But, if player 2 follows this argument, then he clearly will 
take By rather than 8;. ‘That being the case, player 1 should take ag, etc. 
This cyclic effect is all the more reason why both players should stick to 
the maximin and minimax strategies, in which case player 1 should defect 
to ai, etc., and we go around in circles again. 

Such an argument seems to force us to assert that player 1 should be 
indifferent between a; and ay. If so, then he should be willing to toss a 
coin to decide between them. Clearly, if player 2 chooses 61, then tossing 
a coin is preferable to a2; if 2 chooses B2, tossing the coin is preferable to 
a. But if 1 does not know whether #; or 2 will be chosen, should he 
prefer a toss of the coin in preference to either a; or a2? After all, we 
have argued ourselves into the position where we are indifferent between 
a, and a», so what help is tossing a coin to decide? The answer is that it 
raises the security level. Let us see how. 

If 8; is used, then player 1 receives 3 with probability }4 and 2 with 
probability 14, which is certainly preferable to a certainty of 2. If Be is 
chosen, then player 1 receives 1 with probability 44 and 4 with probability 
1. In this case the direction of preference is less clear, but we note that 
the expected value of the gamble is 5g. Can one say this is preferred to 2? 
Certainly there is no such assurance if we are talking about money, but 
this difficulty is avoided by assuming that these numbers are utilities, i.e., 
numbers which arose from the individual’s preference patterns over out- 
comes which may have been money or other stimuli. As we pointed out 
in Chapter 2, if the preference pattern over gambles satisfies certain con- 
sistency requirements, then the expected value of the utility function 
represents the preferences. ‘Thus, the gamble with the expected value 
54 is preferred to the sure thing of 2, even though in the gamble player 1 
may get as little as 1. If this were not so, we would have to conclude 
that the numbers 1, 2, and 4 are inappropriate numerical indices of the 
subjective worth—utility—of the outcomes they purport to reflect. 

Perhaps the argument that a chance event such as the toss of a coin 
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4.8 THE MINIMAX THEOREM 


The gist of this section is an informal statement of the central theorem of 
two-person zero-sum theory; in effect, it says that the analysis given in the 
last section of a specific example did not rest on any peculiar features of 
that example, but that the same properties hold for any zero-sum (strictly 
competitive) game. In Appendix 2 a self-contained and rigorous state- 
ment of this theorem is given, and if the reader desires more formality 
at this point he should turn to that appendix. 

Let us first recall the result that holds for strictly competitive games 


which have an equilibrium pair among pure strategies (sections 4.4 
and 4.5): 


There exists a number v, a pure strategy (a maximin strategy) for player 1 which 
guarantees him at least v, and a pure strategy (a minimax strategy) for player 2 
which guarantees that player 1 gets at most v. These pure strategies are in equilib- 
rium, and any pair of pure strategies which are in equilibrium yield a maximin and 
a minimax strategy for 1 and 2 respectively. 


Now let us consider all strictly competitive games with a finite number 
of pure strategies, whether or not they have equilibrium pairs. It is 
clear that we shall want to generalize the notion of a mixed (or random- 
ized) strategy as given in the last section. If player 1 has m pure strategies 


@1, 2, * * * , @m, then a mixed strategy is a probability distribution over 
m points, i.e., it is a set of m numbers x1, x2, * * * , Xm such that 
eM, FOR Mee.) eee 1772, ANG. Aa tte ce tt” ate Le 


The mixed strategy can be symbolized by (xia, xa, * - + , XmQm). 
A similar definition holds for the mixed strategies of player 2, except that 
the probability distribution is over n points. 

The principal theorem—known as the minimax theorem—asserts that 
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The primary assumption leading | 
maximin strategies should be the s 


it requires that the absolute level of a person’s w y 11 nang 

when a constant amount is added to each payoff entry—shall not result in a 
utility payoff which alters his strategic considerations. Asa normative condition, 
this is acceptable and interesting; as a description of behavior, it is very doubtful. 
Absolute levels of wealth do appear to influence behavior. < 


4.9 COMPATIBILITY OF THE PURE AND MIXED 
STRATEGY THEORIES 


Although there is no conflict of interest among the mixed strategy 
equilibrium pairs nor among the pure strategy equilibrium pairs when 
they exist, one might fear that there would be between the two classes of 
equilibrium pairs. In other words, one might fear that the mixed strategy 
theory and the pure strategy theory would differ for those games having a 
pure strategy equilibrium pair, that the optimal security level using mixed 
strategies would differ from that using pure strategies. It can be shown 
that this is not the case. Furthermore, the pure strategy equilibrium 
pairs are also mixed strategy equilibrium pairs; hence, with respect to 
the best security levels and equilibrium pairs, no complications result by 
introducing mixed strategies into games with pure strategy equilibrium 
pairs. 

It remains true, however, that even though a player has a pure equilib- 
rium strategy and realizes it, he may have reasons to play differently if he 
thinks that his opponent may not play an equilibrium strategy. One may 
depart from an equilibrium strategy to exploit an opponent’s supposed 
ignorance, but it may be sensible to use a mixed strategy as a hedge 
against extremely unfavorable situations and against the possibility that 
one’s opponent has more insight into one’s behavior than anticipated. 

It will be recalled that we have gotten to this point by the following 
ay cae “d Mae oa we set up an a priort demand of any theory 

y selections, and we showed that this meant that 
an acceptable theory would have to single out equilibrium pairs. In 
section 4.5 we established, first, that not all strictly competitive games have 
pure strategy equilibrium pairs, and, second, that, in those games which do 
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4.10 ON THE INTERPRETATION OF A MIXED STRAT 


What we have just related is certainly mathematically above reproach, 
but the skeptical reader may question whether at possesses any meaning 
conceptually. What does it mean to select a mixed strategy and would 
one ever really choose one? 

As to the meaning, this depends very greatly upon one’s interpretation 
of probability. We shall take the point of view that the selection of a 
pure strategy by means of a mixed strategy is equivalent to performing an 
experiment. Let us suppose player 1 has the pure strategies aj, a2, °° °, 
m, and let x be the symbolic representation of the mixed strategy where 


Player 1 adopts one and only one pure +: 
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experiment x‘) player 1’s securi 
yields some pure strategy, say 
level is changed to v(a¢). 

Now, if we are motivated solely by securi 
we may be quite unhappy with the 
that some other strategy, say ag, has a much higher security level than 
ag—and so we may be tempted at f2 to adopt a pure strategy different from 
the one dictated by the spinner! But, if we are going to tamper with our 
fate as given by the experiment, why be so silly as to construct and perform 
the experiment in the first place? After all, if we are cognizant that we 
will not abide by the experiment x‘"’, then our security level is not v(x‘) 
from to to ¢4. 

Indeed, let us suppose that the player is offered one of two options: 
In option 1 he does not have the privilege of ignoring the dictate of the 
spinner, and in option 2 he need not follow its dictates. Since option 2 
includes all the possibilities available to him under option 1, plus others, 
it would seem that it should be preferred. In that case, a critic of the 
randomized strategy concept can argue that between ¢; and f2 the player 
knows what pure strategy the spinner has selected, and so the mixed 
strategy x‘ used to arrive at it is strategically irrelevant. One must 
compare that pure strategy on its own merits with the other pure strategies. 
Consequently, the concept of a mixed strategy is a convenient mathe- 
matical tool but it completely fails to be realistic. 

The most common counterargument in defense of mixed strategies is 
the observation that they withhold from our opponents knowledge of the 
pure strategy we will use. ‘This, it is contended, is important, for such 
knowledge can be exploited. Although granting that this is often a perti- 
nent point, some authors feel that it certainly cannot be a complete 
defense, for mixed strategies are appropriate even when we know that the 
other player is not in any way concerned with one’s behavior. This may occur 
if the other player is not aware of the game payoffs, or in games against 
nature. But, if the defense of mixed strategies should not be confined 
solely to a secrecy argument, what should it be based on? The supple- 
mentary defense is that, psychologically, option 1 should be preferred to 
option 2, contrary to our assumption above, just because it does not per- 
mit us to fall prey to our human frailty. It is not unlike the person who 
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It will be helpful to employ the pictorial representation shown in Fig. 1. 
Let each point on the horizontal axis represent symbolically a pure or 
mixed strategy for player 2, and let the vertical axis represent the payoffs 
to player 1 when he employs the strategy x. Thus, to each y for player 2 
there is a value M i(x, y) for player 1; therefore for each x we can draw 
a curve representing the payoff to 1 as the strategy for 2 varies over its 
domain, Each such curve is associated with a specific strategy x, so when 
we wish to consider more than one strategy for player 1, we shall have to 
Se i several curves, each labeled by its associated strategy. 

In Fig. 2, strategy x‘) is clearly as good as x‘) for all values of wet 
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guarantee himself a security level larger than 0, say 49. We may suppose 
that he governs his choice at the following move to squeeze out this 
advantage. Similarly, at each later stage he can calculate his security 
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2 induces player 1 to choose g at move 4 and then responds by selecting 
j and playing the game Iz impeccably. We shall return from time to 
time to the psychological problems which beset the shrewd player pitted 
against an antagonist who is shrouded in some mystery; however, this 
topic is best resumed under such headings as game learning in the non- 
strictly competitive case and games against nature. 

It should be pointed out that our using the extensive rather than the 
normal form for this discussion was not accidental. It is true that the 
reduction to the normal form leaves the strategic aspects of the game 
intact, but often it does not seem best suited to dealing with pertinent 
psychological, i.e., descriptive, information which we may have. 


*4.12 A GUIDE TO THE APPENDICES ON 
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We believe, and probably most of our colleagues would agret, gies 
many important and interesting games will never be solved. This does 
not imply that game theory will never contribute anything to the under- 
standing of these realistic games. Often, a modus operandi for a complicated 
case is to consider an auxiliary game which is motivated by and related 
to the original one in such a way that many of the important phenomena 
of the original are retained while the auxiliary remains solvable. From 
the solution of the auxiliary game one speculates informally how the 
results are modified in the original game. Thus, for example, there are 
simplified variants of both poker and bridge in the literature (see Bell- 
man and Blackwell [1949], Bellman [1952 6], Gillies, Mayberry, and von 
Neumann [1953], Kuhn [1950 6], and Nash and Shapley [1950]). Such 
studies are in much the same spirit as economic analyses of idealized 
Robinson Crusoe and Swiss Family Robinson economies which, by means of a 
lot of hand waving, are used ‘“‘to explain’? economic phenomena and to 
reach policy decisions concerning the economy at large. This is danger- 
ous, yes! Yet it is quite stimulating to our creative intuitions and often 
helpful in purely literary, pseudological (mot said deprecatingly, but 
rather pragmatically) theorizing. 

Contentwise, Appendix 6 begins with a discussion of the trial-and-error 
technique. A few guide posts are suggested for those who have occasion 
to indulge in this guessing game. Next, a rough geometrical (Appendix 
3) explication of the Shapley-Snow algorithm is given and its relation to 
the double description method is explained. The simplex method—the 
most common way to solve linear-programing and game problems—is 
described, and its relation to the dual simplex method is illustrated 
geometrically. Finally, iterative techniques for finding solutions are 
given, €-8:5 a differential equations approach to equilibrium is examined. 
oe oe : games by fictitious play, due to Brown and von 

pao particular interest here since it has conceptual 
overtones for a descriptive theory of games. Brown [1951] states: “Th 
iterative method in question can be loosely characterized by the fait = 
it rests on the traditional statistician’s philosophy of basing future deci- 
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In the final appendix we surve) 
two-person games in exte 


stitutes one of the most active 


;. A succession of stages 01 

ii. At each trial the players 
choices. 

iii, At each trial the players make simulté 

iv. The number of trials, finite or infinite, may or may not be fixed in 
advance, but usually the length of play is determined by chance ana by 
the actual sequence of player’s choices. 

The strategic problem at each trial can be visualized as a two-person 
game in normal form, which itself is treated as a single component of the 
dynamic supergame. 

In the class of games known as stochastic, the strategy choices in a com- 
ponent game not only determine the positional payoff—an exchange of 
money or goods—but also control the probabilities governing which 
component game is to be played at the next trial, if any at all. The 
structure is so restricted that the play is almost certain to terminate in an 
undetermined but finite number of trials. In a recursive game payoffs are 
not made during the play; rather they occur at termination if the play is 
finite. For plays of infinite duration a convention is adopted for assigning 
payoffs. Such “real world” problems as games of (military) survival, 
attrition games, economic ruin games, dynamic programing, or com- 
pound statistical decision problems (e.g., classifying a stream of subjects) 


fall—admittedly a little unnaturally at times—into these categories of 
dynamic games. 


4.13 SUMMARY 


In this chapter we have covered what is probably the most central 
aspect of game theory: the existence of pairs of equilibrium strategies in 
two-person zero-sum games. A strictly competitive (or, equivalently, zero- 
sum) two-person game is one in which the two players have precisely 
opposite preferences. It is, therefore, a game in which cooperation and 
collusion can be of no value. Any improvement for one player necessi- 
tates a corresponding loss for the other. The term “zero-sum” is used 
because it is possible to choose the zeros and units of the two utilit 


; func- 
tons so that they always sum to zero. Such games are most rather 


represented by matrices: the rows representing player 1’s pure strateg 
choices; the columns, player 2’s strategies; and the entries, the wanes 
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| It was shown that not every zero-sum two-person game has a (pure 
strategy) equilibrium pair, and that when one does exist it is not neces. 
sarily unique. The non-uniqueness does not, however, generate a new 
conflict of interest, for equilibrium pairs are equivalent in the sense of having 
tH the same utility payoff, and equilibrium strategies are interchangeable in 
2 that any pair, one from each player, forms an equilibrium pair. 
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: i, he equilibrium Strategy maximizes a player’s security level, and, 
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worth, for, if in the final analysis : 

the “‘best” pure strategy be select 
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use them, but what if one fails to? It was pointed out that the 


player might profit by also deviating from an equilibrium strategy, but 
that this carries with it the risk of a more serious loss than could occur 
with an equilibrium strategy. 
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Twoerson NON-ZERO-suM 
NON-COOPERATIVE GAMES 


5.1 INTRODUCTION 


A non-strictly competitive game is exactly what it Says: a game which 
fails to be strictly competitive because t 
L and L’ over the Outcomes of the g 


ame such that one player prefers 
L to L’ and the other does not prefer ZL’ 


toZ. For such games it is impos- 
Ctions of the players so that they sum to zero; 
€ terms “non-strictly Competitive” and “non-zero- 
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complexity of this penumbr 
mathematical model, 
among economists, sociologist 
should give them pause wh 
explanation. 

In strictly competitive games i 


mutual benefit by any form of cooperatior 
petitive games such mutual gz 


te) 


forced to consider explicitly whether or not the players are permitted to 


ri 


cooperate. We shall only examine the two most extreme assumptions. 


freedom of preplay communication to make joint binding agreements. In 


By a cooperative game is meant a game in which the players have complete 


a non-cooperative game absolutely no preplay communication is permitted 
between the players. The latter games with two players are our present 
topic; the two-person cooperative games will be taken up in the next 
chapter. However, even in this chapter we shall from time to time 
attempt to indicate the differences preplay communication introduces. 
As before, we denote the players by 1 and 2, their respective strategy 
sets by A = {a1, -* * , am} and B = {B, - - - , Bn}, and the outcome 
associated with (a;, 8;) by O;;. We assume that each player has prefer- 
ences among mixtures of outcomes which lead to a linear utility function; 


let a;; denote the utility of outcome O,; for player 1, 6;; for player 2. This 
leads to a table of the form: 


Sa ee B; ees 


ay 
a2 


a; ree ofa a bis) 


Am 


Unless otherwise stated, we shall assume that both players are aw 


are of all 
the data contained in the above table. 


a2 REVIEW OF THE SALIENT ASPECTS OF ZERO-SUM GAMES 


It will be useful once again to summarize some of the important proper- 


ties of the strictly competitive game, for these will be found not to hold in 
Some non-zero-sum games. 
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M(x, y) = — M(x, y)s 


: term zero-sum. 
Mie caer * following properties of zero-sum games: 

i. It is never advantageous to inform your opponent of the (pure or 
mixed) strategy you plan toemploy. (Of course, if a player plans to use 
an equilibrium strategy, his security level is not diminished by disclosing 
his intentions—but nothing is gained by the disclosure.) 

ii, It never benefits the players to communicate prior to the play and 
to decide upon a joint plan of action. 

iii. If (x, y) and (x’, y’) are both in equilibrium, then: 

(1) y’) and (x’, y) are both in equilibrium, 
and 


(2) M(x, y) = M(x’ 


aa »¥) = M(x, y’) = M(x’, y), 
M2(x, y) = 


OWeTE xis am 
Is an equilibrium 


M(x’, y’) = M(x, y’) = M(x’ y). 
yandya 
, and Conversely, 
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minimax Strategy, then (x, y) 


5.3] eee 


Various interpretations arc possibl. 

call it the “‘battle of the sexes”: 

each have two choices for an evening 

go to a prize fight (a; and 1) or toa! t (a 


usual cultural stereotype, the man n 
the ballet; however, to both it is more important that tney 80 ' ogetnet 
than that each see the preferred entertainment. Lett 


game possesses any of the four characteristics of zero-sum games. 


The power of disclosing one’s strategy. Player 1 would | 


tent with (a1, 81) whereas 2 prefers (a, 82). If 1 announces that he plans 
to choose @, and that no arguments will alter his choice, and if 2 has faith 
sn 1’s stubbornness in sticking to his announced intentions, then she has no 
alternative but to choose 6;. A similar argument holds if 2 announces 
her intentions first. Thus, we see that it is advantageous in such a situa- 
tion to disclose one’s strategy first and to have a reputation for inflexibility. 
It is the familiar power strategy: ““This is what I’m going to do; make up 
your mind and do what you want.” If the second person acts in his own 
best interests, it works to the first person’s advantage. 

Preplay jockeying and its effect on utilities. In connection with the 
preceding point, we should like to recognize another phenomenon which 
will play havoc in much of the subsequent discussion; however, having 
raised it, we wish to de-emphasize it for now and to return to deal with it 
more fully later. 

If, in the preplay discussion, the man says he is already committed to 
the prize fight and demonstrates his intention of going by producing the 
ticket he has already purchased, this may cause the woman to submit to 
his will, as argued above. But, to some spirited females, such an offhand 
dictatorial procedure is resented with sufficient ferocity to alter drastically 
the utilities involved in the payoff matrix. Preplay communication is 
considered outside the game structure of the payoff matrices, yet in some 
cases it may result in a radical change of one player’s preference pattern 
and therefore of the payoff matrix. In such cases we could, perhaps, 
enlarge the space of strategies and complicate the game to include the 
preplay negotiations. Later we shall return to such points, but for 


now we shall suppose the payoff matrix remains invariant during the 
negotiations. 


Some complications in the equilibrium concept. Continuing with 
the same payoff matrix, we note that both (a, 81) and (ag, B2) are 
equilibrium pairs, since each strategy in one of the pairs is best against the 
other in the same pair. However, neither (a, 82) nor (a2, 61) is an 
equilibrium pair. Furthermore, (a1, 81) and (a2, Bz) do not yield the 
same returns to the players. Note well how completely these observations 
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associated with each x.” After some calculation, 1 finds that his maxi- 
min strategy is (2401, 342) and that this results in a security level of ls. 
Furthermore, he sees that if 2 selects 6; the returns are (14, —1<) and 
with 82 they are (14, 44). 
Player 1 continues hi esi : 
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2, and here we go again.” 
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$1 to the player; doubl. 

player selects safe; and a 

up $.05 to the house. 

announce their choices sin 

house wants both to use the 

a fair game to the contestants sinc 


The game is played only once by contestants who have never met 
should be cheap advertising! 
* Bis C aT r it ee as ~ (eee ie 
A geometrical representation of the game. Another way to sce tle 


complexities of the non-cooperative version of the game we have been dis- 


cussing is to make a geometrical plot of the possible payoffs. Along the 


(1, 2) 


Player 2’s utility 


Player 1’s utility 


(-1, —1) 
Fic. 1 


horizontal axis we plot player 1’s utility, and along the vertical, player 2’s. 
| Only certain combinations can arise; these are shown as the shaded area 
| of Fig. 1. To each pair of mixed strategies (x, y) there corresponds a 
| payoff which is one of the points in the shaded region; conversely, to each 


point in the region there corresponds at least one pair of strategies with 
this point as payoffs. 


> It is worth noting that the pair of mixed strategies [(3$a1, 24a), (2681, 36B2)] 
is a (symmetric) equilibrium pair with a payoff of (14, 14). There is little reason 
Owever, to expect the players to choose these strategies, for if 2 were to chingae 
7681, 3482), then all of 1’s strategies are equally good against it in the sense that 
they all have an expected return of 14, and player 1’s maximin Strategy (24a, 
$02) guarantees him 1% against any strategy 2 might select. Thus, the masinaie 
| Strategy seems preferable to the strategy of the symmetric equilibrium pair; yet 
it produces the complications we discussed above. Although this seemingly 
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non-cooperative game. 
Cooperatively, it is C 
and an “equit 
ligne (a1, Bi) is jointly chosen, and tails that (a2, 62) is jointly 
chosen. In the language of our interpretation: heads, the couple goes to 
the fight; tails, to the ballet. The utility for this joently arranged random- 
ized strategy is (14)2 + (19)1 = 34 for each player. Note, however, 
that in the non-cooperative context a return of (34, 34) is never possible— 
it lies outside the shaded region of Fig.1. ‘The strategy which randomizes 
between (a1, 81) and (a, Be) can never be achieved if each player randon- 
izes his strategies independently—which is exactly what must occur in the 
ee rnc context. Thus, contemplating. the cooperative situation 
T ysvustrating for those prohibited from preplay communicati 

*emporal collusion. Before we turn t a unication. 
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The following interpretation, known as the prisoner's dilem! 
Two suspects are taken into custody and separated 


is certain that they are guilty of a specific crime, but he doe: 

adequate evidence to convict them at a trial. He points out to cath 
© ig 4 the crime th | lice 

prisoner that each has two alternatives: to confess to the crime the polic 


are sure they have done, or not to confess. If they both do not confess, 
then the district attorney states he will book them on some very minor 
trumped-up charge such as petty larceny and illegal possession of a 
weapon, and they will both receive minor punishment; if they both con- 
fess they will be prosecuted, but he will recommend less than the most 
severe sentence; but if one confesses and the other does not, then the con- 
fessor will receive lenient treatment for turning state’s evidence whereas 
the latter will get “‘the book” slapped at him. In terms of years in a 
penitentiary, the strategic problem might reduce to: 


Prisoner 2 


Prisoner 1: Not Confess Confess 
Not Confess] 1 year each 10 years for 1 and 
3 months for 2 
Confess 3 months for 1 and 8 years each 


10 years for 2 


If we identify a; and 8, with not confessing and a2 and 8» with confessing, 
then—providing neither suspect has moral qualms about or fear of squeal- 
ing—the above payoff matrix in utilities has the right character for the 
prisoner’s dilemma. The problem for each prisoner is to decide whether 
to confess or not. The game the district attorney presents to the prisoners 
is of the non-cooperative variety. 


Another version of this payoff matrix which will be intuitively more use- 
ful in some of the following discussion is 


Bi Bo 
H: a) ie 5) (—4, 6) | 
rg tO. 4) (3, = 3) |. 
This will be given the interpretation that an 
loses $4 and player 2 receives $6, and we s 
Wishes to maximize his monetary return. Note that if we take the utility 


of money to be linear with money and set the utility of $6 to be 1 and of 
—$4 to be 0, then the game G results from H. 
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Sela try to argue that the differences between 1 and 0.9 and 
between 0.1 and 0 are so small that even a criminal’s ethics would make 
him select the first strategy so that they would not both be Caught in the 
“stupid” (0.1, 0.1) trap. Such an argument is inadmissible since the 
numerical utility values are supposed to reflect all such “ethical” con- 
siderations. No, there appears to be no way around this dilemma. We 
do not believe there is anything irrational or perverse about the choice of 
@2 and 82, and we must admit that if we were actually in this position we 
would make these choices, 

Perative aspects, [et us sup 


could Cooperate; then it is cl me ft the Players of the game 6 


far that they would enter into a binding pact 


St if a < 
F en he Profits; whereas, if h Player defects and his opponent does n0t, 
OS€s more t asc "4 € fails to defect and his opponent does, h¢ 

Cont 
ick : Meh a “dou cro; : were both to defect. Within the criminal 
to iy: * d that it Would not 1, “nSender serious reprisals and s0 I! 
ignored su thay terpret tion Se while. This seems, howeveh 
i Onsj . i 0 ° ; 
better include : bp nations i abstract: € given Numbers, If we havé 
tan enlar € reaking of a bing. 8 2 game from reality, we had 


Alt ; am indin 
netely, We can Purportin t = agreement as an integral aspect 
'S 80 di Mmarj a + eal 
© disastroy, PPose tha : See Gonflict of i ntees 
. Of Dreakj binding agte 
Conside ng a binding. 
Ted 


Pxthe, 
=e ieee : jce- 
a>, B.)> solve the am learly (a), B;) is the cho 

© think a Cooperative game in any W*' 


ne 
© hopelessness that ° 


feels in such a game as this can: 
“rational” and “‘irrational”’ 
should be a law against such games 
one essential role of governme 
social “‘games’” must be chang: 
situation that the players, in pursuins 


socially undesirable position. That 
exist is illustrated in the next paragraph. 
An alternative interpretation. As an n-person analogy to the pris- 


oner’s dilemma, consider the case of many wheat farmers where each 
farmer has, as an idealization, two strategies: “restricted production” and 
“full production.” If all farmers use restricted production the price is 
high and individually they fare rather well; if all use full production the 
price is low and individually they fare rather poorly. The strategy of a 
given farmer, however, does not significantly affect the price level—this 
is the assumption of a competitive market—so that regardless of the 
strategies of the other farmers, he is better off in all circumstances with full 
production. Thus full production dominates restricted production; yet 
if each acts rationally they all fare poorly. 

In practice the equilibrium may not occur since the farmers can, and 
sometimes do, enter into some form of weak collusion. In addition, a 
farmer does not play this game just once. Rather it is repeated each 
year and this introduces, as we shall see in the next section, an element of 
collusion. Finally, sometimes the government feels as we do, steps in, and 
Passes a law against such games. Of course, in this analysis we have 
neglected the consumer. When he is included collusion may not be 
socially desirable even if it is desirable for the farmer. 


5.5 TEMPORAL REPETITION OF THE PRISONER’S DILEMMA 


Let us consider a game which is analogous to the prisoner’s dilemma in 
the sense that it has the same strategic one-play aspects but which can be 
meaningfully repeated in time. Let it be of the form 
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and suppose that it is repeated successively in time. 


What we are a: 


1e1r 


ference. 
pre does reflect th 


ility notion, consi 


tl 
e of the u a 
ization of their own expect 


payoff matrix represent m 
have in one way OF an¢ ia 


not refl 
the following discus 
ross an abuse 
din the maxim 


bers in the 


seems too § 
only interest€ 
and let the num 


layers 
ecting (@1, P1/- + be temr uaa 
8 a will “probably” choose (1, he a bet jueeze 
at his o : 9. Oweve! and 
ss more out of the next game by choosing &2 . pie. 
aly should—anticipate that the — Pe Ka2, b * SaKe 
: ; is driven to play @2 1n that game, and 
By in the next game, in which case he play re: 


so in total he will lose more than is compensated for by a prens 
Thus, we may argue that his contemplation of the resulting ae tends to 
keep 1 in line, and, if he is unable to reason so clearly about the future, a 
little experience should soon set him straight. From these arguments, we 
see that in the repeated game the repeated selection of (aj, 8;) is in a sort 
of quasi-equilibrium: it is not to the advantage of either player to initiate 
the chaos that results from not conforming, even though the non-con- 
forming strategy is profitable in the short run (one trial). 

It is intuitively clear that this quasi-equilibrium pair is extremely 
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choices, and the payoff is merely the sum of pay 
ponent games.’ A strategy for player 1 in this new game is complex 
since it must prescribe what choice 1 is to make on each move and it may 
explicitly take into account the past history of all preceding moves. 
Complicated or not, it is certainly possible to consider the supergame in 
normalized form where each player merely chooses one overall strategy 
which dictates his full behavior pattern. It turns out that any equilibrium 
patr of strategies will result in the repeated selection of a2 and B2 on each of the 100 
moves. Furthermore, repeated use of ag and B, are maximin strategies. 
Therefore, if one wishes to maximize his security level he should play the 
second strategy in each component game, and this strategy is reinforced 
by being in equilibrium. Although any equilibrium pair will result in 
the repeated use of a2 and Bo, this does not mean that each player has a 
unique equilibrium strategy. For example, an equilibrium pair results 
when each player adopts the strategy of playing a2 or 2 for the first 99 
moves and then changes to a; or 6; on the 100th move if his adversary 
has used his first strategy for the first 99 moves. There are many other 
such pairs. The point, however, is that, even though there is not a unique 
equilibrium pair, there is a unique equilibrium outcome. 
An indication of the technique used to prove this result will prove use- 


ful, for the method applies to other situations. Let us list the overall 
strategies of the supergame as: 


Aa {ri, Sy Se age aes aia B, = {s1, 525 r » Snx}> 


| where J, is a finite but fantastically large number. One first notes that 

| some of 1’s strategies may be strictly dominated, i.e., 
pair r; and r; such that r; is never worse than r; no matter what strategy 
2 employs and for some of 2’s strategies it is better. These dominated 
strategies may be thrown away with no loss to 1, leaving a new pure 
strategy set Ay with No < N, strategies. Similarly, player 2 has some 
strategies which are strictly dominated relative to A 1 and these may be 
thrown away leaving a set By with (because of the symmetry of the super- 

. game) N» strategies. Now, as long as 1 knows that 2 will confine himself 


there is at least one 


*See footnote on opposite page. 
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If we were to play this game we would not take the second strat 


every move! 

Let us see why. Denote by as” the strategy in which ap is used 
exclusively from trial i on to trial 100, and for trial 7 < 2, a is used if and 
only if player 2 has used 6) on trials 1,2, ---,j7— 1. In other words. 
i use a so long as 2 uses 8; olay trial 2, whichever comes first, and 

1 . *y.° . 

after that we use a». Clearly, a» is the equilibrium strategy of playing 
a2 on each of the 100 trials. In a similar way, BS” is defined. Now if 
player 1 knows that 2 plans to use BS”) for k > 1 h si : od 
- oneal 2 , then his best response is 
Babar tintthe va x a worst response is the equilibrium strategy as) 
ust select one of the i , | ai 
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This game has the same qualitative characteristics as H in that the second 
strategy dominates the first for each player; however, there is a much 
stronger temptation to play the second strategy. In this case a strategy 
like a$’, literally interpreted, seems “‘reasonable.” 

Let us recapitulate. If H is played but once, we feel that it is ‘‘reason- 
able” to single out (a2, B2) as the “‘solution” of the game provided that 
there is no preplay communication. By reasonable we mean that we 
predict that intelligent players will play accordingly and, furthermore, 
that they will stz// do so even after a full airing of the “theory” of such a 
game. ‘True, players will find themselves completely frustrated; none- 
theless they have no real alternative. (Incidentally, in some contexts, if 
the two players are frustrated, it may be beneficial to society.) In con- 
trast, we do not think it “reasonable” to single out (a$!, 6$”) as the 
“solution” when #H is iterated n times, even though an equilibrium point 
results from this type of behavior and even though each of these strategies 
is undominated in the wide sense. It is not “reasonable” in the sense 
that we predict that most intelligent people would not play accordingly. 
Unfortunately, we know some individuals who, although brilliant in other 
ways, insist they would choose a@$!. Yet, we feel that, as long as our 
“subjective a priori probability” for our opponent selecting B$” is less than 
1, we should not single out a§”. We feel that in most cases an unarticu- 
lated collusion between the players will develop, much in the same Way as 


_ 4 mature economic market often exhibits a marked degree of collusion 


without any communication among the participants. This arises from 
the knowledge that the situation will be repeated and that reprisals are 
possible. One cannot help reflecting that, unfortunately, 


and political spheres the participants all too often ha 
ordentation. 
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j Flood [1952] has performed some empirical work on the 100 
tion of the above game. His running dialogue of the 
move reactions is amusing as well as instructive. 


The iterations we have been discussing are of a somewhat special 
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ond, to let the number of trials be a random variable i e bldg 
bution of the exponential type, 1.€., the conditional proba vility of « xactly n 
more trials, given that & trials have already occurred, is independent of k, 
An equivalent formulation of this stopping rule is to terminate the play at 
each trial with probability 1 — p and to repeat the game again with 
probability ~ 1. Games of this general type, which are known as 
stochastic games, are studied in Appendix 8. If, in the first case, the 
discount rate is not too small and, in the second, the “repeat” probability 
p is not too small, then it is possible to show an equilibrium pair exists 
which results in the repeated use of the first strategy by both players. 


The third method assumes that the number of trials is fixed but unknown 
to the players. This last sug 
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zero-sum and non-zero-sum games. For zero-sum games, te repeated 
use of strategies which are optimal in the small constitutes a st which 
is optimal in the large; for non-zero-sum games, the re] ise of 


strategies which are optimal in the small might be unrealistic in the large, 
although if both players choose these strategies they are in equilibrium. 

Actually, when a zero-sum game is iterated one is in a position to gain 
and exploit statistical information about one’s opponent. Clearly, this 
is important in life, and it emphasizes the difference between a descriptive 
and normative theory. An example may point this up. Consider the 
zero-sum game of matching pennies in which each player has two pure 
strategies: heads (a1, and ;) and tails (ao, and B2). If they both choose 
the same strategy, 1 receives $1 from 2; if they differ, 2 receives $1 from 1. 
The money payoff matrix for player 1, therefore, is: 


Bi Be 
cal 1 -1 
ag|—1 ifs 


Let us assume that each player attempts to maximize his expected mone- 
tary return. Player 1’s maximin strategy places a probability of 16 on 
each strategy; this yields an expected return of zero no matter what 2 does. 
Thus, even if 2 fails to play his minimax strategy this deviation is not 
exploited by 1s maximin strategy. This is the story we outlined in sec- 
4.11. If the game is iterated, 1 can obtain sequential information about 
his opponent’s strategy and, if it appears not to be optimal, then he can 
attempt to exploit the deviation. To take an extreme example, suppose 
2 is playing heads with probability 34; then 1 should play heads with a 
probability greater than 44. Mathematically, at least for a single trial, 
his best choice is heads with probability 1; however, in an iterated game 
2 would soon spot this strategy. Player 1 must try to exploit 2’s blunders 
without, however, teaching him the error of his ways. If 1 judges his 
opponent to be shrewd, but not as shrewd as he, a more subtle tactic might 
be used: 1 departs slightly from optimality, and he lies in wait knowing 
that ultimately 2 will notice it and attempt to exploit it. When 2 alters 
his strategy, 1 detects the change quickly since he is anticipating it and, 
since this strategy cannot be optimal, 1 subtly changes his strategy 
accordingly. When 2 again catches on, 1 is ahead. 

Such considerations as these seem realistic and would have to be encom- 
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be a non-cooperative 7 ‘can expect 
: is to suggest a strategy or class of strategies for each of the players. 
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yet if the pair of strategies chosen is not in equilibrium there are reasons 
for the players not to act in conformity with the theory (see section 4.4), 
But since in the example (a1, 61) and (a2, 82) are both in equilibrium, 
yet (a1, 82) and (a2, 6;) are not, what can the theory suggest to the 
players? 

One might expect, however, no difficulty if each player had but one 
equilibrium strategy, as in the game 


ie 0.9) (0, 1) 
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5.7] The Role of Equilibriu 
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We may also add that, even if it is possible to produce pathological 
examples which throw doubt upon the universality of a concept, this does 
not necessarily undermine its importance. It merely establishes that 


care must be exerted to check whether the concept is plausible in the 
specific cases to which it is applied. Ideally, one should attempt to 
investigate the mathematical restrictions which should be placed on the 
domain of admissible games so that the concept is plausible. In the case 
of the equilibrium point concept for non-cooperative games, we know 
that several major difficulties exist; nonetheless, it is an exceptionally 
important tool for the analysis of wide classes of economic games. 

Even if we were to decide to reject equilibrium points as a normative 
theory for non-cooperative games (and remember there is no real alterna- 
tive) it may still be that the notion is relevant in a description of behavior. 
Although not “‘all life is a game,”’ at least not in our sense, we cannot fail 
to recognize that people are constantly jockeying to better their lot in a 
manner which is quite analogous to playing in an extremely complicated 
many-person game. For a given society, a set of mores and patterns of 
behavior gradually build up and then remain stationary for long periods 
of time; yet another society, with approximately similar initial conditions, 
will evolve to a quite distinct pattern of cultural norms. Loosely speak- 
ing, we may regard these as two possible equilibrium “solutions” to this 
“game.” They are equilibria in the sense that an individual usually 
finds it disadvantageous to buck the tide of society’s opinion. It is our 
impression that players of a game do, in some sense, evolve to an equilib- 
rium position—not necessarily a unique one— and we can say that, from a 
descriptive, though not a normative, point of view, the set of equilibrium 
points of a game do constitute a “‘characterization of the solution” of the 
game. Although the following is not grounded in any specific empirical 
studies, we can imagine the participants of a complicated game flounder- 
ing about using trial and error methods to arrive at a suitable mode of 
behavior and finally settling down to a pattern which is not in any sense 
a “social optimum,” but which nevertheless is in equilibrium, since it does 
not profit any player individually to pioneer in new directions. Much of 
the n-person theory we shall discuss, if it is to be interpreted descriptively, 
must be interpreted in this manner. 
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5.9] Definitions of “Solutio: 
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get. 
In the game 


| 
( ama 
every pair of pure or mixed strategies is in equilibrium and so it is auto- 
matically Nash solvable. Observe that a player’s strategy choice has 
absolutely no influence on his return; this is entirely governed by his 
opponent’s choice. Such a game is particularly frustrating since all it 
amounts to is this: 1 can give 2 from one to three (utility) units of satis- 
faction while remaining completely indifferent himself among his choices; 
similarly, 2 can give 1 from one to two units of satisfaction while remaining 
completely indifferent among his choices. Player 1’s upper and lower 
values are 2 and 1, 2’s are 3 and1. Naturally, if the players could com- 
municate they would make a binding agreement to adopt (a1, 82) yielding 
(2, 3), but in the non-cooperative context all strategies are indifferent for 
each of the players. In cases where the equilibrium concept does not 
lead to a unique mode of behavior, the players probably do well to con- 
template the cooperative game. 

A strategy pair (x, y) is said to be jointly inadmissible if there exists a 
strategy pair (x’, y’) such that each prefers the latter to the former, i.e., 


M(x’, y’) > M(x, y) and M?(x’, y’) > M2(x, y). 


In this case, (x’, y’) is said to jointly dominate (x, y). A pair (x, y) is 
jointly admissible if and only if it is not jointly dominated by another pair. 
A non-cooperative game is said to have a solution in the strict sense if: 


i. There exists an equilibrium pair among the jointly admissible 
strategy pairs. 


ii. All jointly admissible equilibrium pairs are both interchangeable 
and equivalent. 


The second condition prohibits confusion in the case of non-unique 
jointly admissible equilibrium pairs. 
The pairs (a, 81) and (a2, B2) are in equilibrium in the game 


bia 1) (0, e 
(0,0) (2, 2)], 


but they are not interchangeable, so the game is not Nash solvable; how- 
ever, (a , 81) is jointly dominated by (a2, 82), so the latter is the solution in 
the strict sense. The prisoner’s dilemma has no jointly admissible 
equilibrium pair, hence it is not solvable in the strict sense, but it is 
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Let X and Y be arbitrar eae 
ers 1 and 2, respectively. ae 
ll restrict his attention to 4 and that 2 will 
restrict his attention to Y, then it is actually possible that 1 need onl 
attend to a subset X* of X. This would be so if every strategy which is in 
X but which is not in X* were dominated by some mixed strategy in X*, 
Clearly, we would want to consider the smallest such set. Thus, we are 
led to the following definition: Given X and Y, the subset X* of X is said 
to be a minimal complete class of strategies of X relative to Y if: 


i. For any x in X but not in X* there is at least one x* in X* such that 
Mj(x*, y) 2 M,(x, y) for all y in Y, and greater than holds for at least 
one y in Y. 

ii. No proper subset of Y* has property i. 
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where X is the minimal complete class for X¥) relative to Y‘"”” ¢ 


In like manner, we may define sets X‘*’ 


Y™ is a minimal complete class for ¥‘"~» relative to X¥%~. As far as 1 
is concerned, a reduction from X¥~” to X¥™ is only safe if he feels confi- 
dent that 2 will confine himself to Y—); if 2 does not and 1 keeps on 
reducing the set of strategies he will consider, he may be asking for 
trouble. 

If 1 and 2 both keep reducing their strategy spaces, the process must 
eventually terminate in the sense that there is an integer N such that 
X™ = X"+) and YS) = YAY, XM) and Y™) are called the 
completely reduced strategy spaces and (X,Y): My, Me) the completely 
reduced game associated with (X, ¥; Mi, M2). A non-cooperative game is 
said to be solvable in the complete weak sense if the associated completely i 
reduced game is solvable in the strict sense. | 

Any pair of strategies in the completely reduced strategy spaces of the 
n-fold iterate of a prisoner’s-dilemma-type game reduces to the choices of 


a and B» from the first move on, so it is solvable in the complete weak 
sense. 


5.10 sOME PSYCHOLOGICAL FEATURES 


Although we have considered many special definitions of what consti- 
tutes a “solution” of a non-cooperative game, the above analysis is 
pitifully incomplete. One might be tempted to argue that, in the absence 
of any adequate theory, the players should, as a last resort, choose the 
Maximizer of their security level; however, it can be shown that the 
resulting maximin value (optimal security level) never exceeds that of any 
equilibrium pair. Thus, even from a very conservative point of view, the 
equilibrium pairs are worthy of a great deal of consideration. 

Within the realm of equilibrium pairs, one might hope to extend the 
domain of analysis further by introducing a more subtle partial ordering 
of the equilibrium points by taking into consideration psychological fac- 
‘ors. For example, in the game 
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In this analysis it is not only the qualitative ordering of the numbers which 
n . at 
counts; the quantitative aspects are extremely significant. 


In the game | 
(4, —3000) (10, aM 
(12, 8) (5, 4) 


the equilibrium pair (a1, 82) is jointly dominated by (a2, 8) since the 
former yields a return of (10, 6) versus (12, 8) for the latter. Thus, the 
latter point is a solution in the strict sense. Yet if we were player 1 we 
would hesitate to use a2 on the grounds that player 2 would argue that 6) 


Psychologically dominates 8), and so long as 2 can give any rationale for 
1’s choosing a, 2 does not dare choose Bi. 
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is simple because qj strictly dominates a2, and Pp; strictly dominates B2. 
Furthermore, the pair (a1, 61) is the unique equilibrium point which is 
jointly admissible, so it is the solution (in any sense) of this non-coopera- 
tive game. Now suppose the players were forced into preplay communi- 
cation. Player 1 can demand that they enter into a binding agreement 
to choose (a1, 82) by the threat to choose ae if 2 does not agree. To be 
sure, 1 does not want to take a2, which would give him only 0, but if he 
does 2 is faced with a loss of 200 (which cannot be said to give 1 any 
satisfaction beyond 0 since we are already dealing with utilities). It is 
reasonable to suppose 2 will succumb to the “threat” if the same numbers 
for players 1 and 2 somehow denote changes of comparable importance. 
Regardless of some of the potential pitfalls of the above analysis, it is to 
2’s advantage to refuse to come to a conference table, for to confer would 
only allow 1 to browbeat him into an agreement. 

If this non-cooperative game is iterated, then it might as well be coopera- 
tive, for 1 can force his desires on 2 by taking a2 a few times until 2 learns 
the “‘score.’? This is a vivid example of “‘collusion through iteration.”’ 

In this example we have some preview of coming attractions in the 
cooperative game: the threat powers of players and their attendant 
interpersonal comparisons implied in such phrases as “‘this will hurt you 
more than it does me.” 


5.12 SUMMARY 


The analysis of non-strictly competitive, i.e., non-zero-sum games, is 
inherently different from that of strictly competitive ones. In the zero- 
sum case, it is never advantageous to disclose one’s strategy, equilibrium 
Pairs are equivalent, equilibrium strategies are interchangeable, and 
mMaximin and minimax strategies are in equilibrium. An example served 
to show that all these assertions are false, in general, for non-zero-sum 
Sames. Further examples exhibited other pathologies. In the prisoner’s 
dilemma, each player has one undominated pure strategy, but the payoff 
to that pair, even though it is in “equilibrium,” is not jointly desirable. 
Thus, although a “‘rational” player cannot do better than play his undomi- 
nated strategy (assuming a single-shot game without preplay communica- 
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should one player succumb to the temptation of a short range gain, the 
other can resort to punitive action by also defecting to his undominated 
strategy for the next few trials. ‘This same game, when repeated a known 
fixed number of times, can be analyzed as a supergame from an equilib. 
rium point of view. The unique overall equilibrium behavior demands 
that each player employ his undominated strategy on each trial, which 
seems contrary to ordinary wisdom. Though we conclude from such 
examples that the equilibrium notion is not universally applicable to 
non-zero-sum games, it still remains an important analytic tool for a wide 
class of games. In addition, even if it is rejected as the basic tool fora 
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to realistic threats without, at the sam 


say, there may well be strategic consideratio 
these we shall ignore, and we shall assume that negotiation 1s compu’ ory 
Most authors feel that, if such economic problems as duopoly, Jabor- 


management disputes, trade regulations between two countries, etc., can 
be treated as games at all, then it will have to be in the cooperative con- 
text. In like manner, one may hope that it will prove possible to formu- 
late cooperative game models which reflect limited aspects of the diplo- 
matic relations between two countries or of the political conflict between 
two parties within a single country. To be sure, such small parcels of a 
complex social or economic problem can be realistic only to the extent 
that the utility functions chosen do reflect the subtle interrelations between 
the game-in-the-small and the overall problem. Given the present state 
of game theory, we are indeed skeptical that many such problems can be 
given a realistic formal analysis; rather, we would contend that a case can 
be made for studying simplified models which are suggested by and related 
to the problem of interest. The hope is that, by analogy, their analysis 
will shed light—however dim and unreliable—on the strategic and com- 
munication aspects of the real problem. 


6.2 THE VON NEUMANN-MORGENSTERN SOLUTION 


Consider again (see section 5.3) the payoff matrix 
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It will be recalled that the set of possible payoffs can be given in a drawing 
as shown in Fig. 1. To any point in the shaded area R there is a pair of 
mixed strategies (x, y) such that the payoffs [M1(x, y), Mo(x, y)] are the 
coordinates of that point; and, conversely, to every pair of mixed strate- 
gies, the corresponding pair of payoffs constitutes a point in the shaded 
area. 

Recall that if this game is repeated in time, it is reasonable for the 
players to alternate, in phase, between their first and second strategies. 
This yields (2, 1) and (1, 2) as alternate payoffs, and the average payoff 
Per trial is (34, 3g). This expected payoff cannot be achieved in a single 
trial if the players randomize without any preplay communication; how- 
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pendent random Strategies. It is, of course, possible if the players act cooperative) 
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if whenever two points lie in the set the line segment joining them also lies in the 
set. Thus, for example, all the points interior to a circle comprise a convex set— 
but not a convex body. A convex body is a convex set which also contains its 
boundary! and which is bounded in the sense that there is a circle about the 
origin which includes the whole set if the radius is chosen to be sufficiently large. 
Another way to describe R’ is as the “‘convexification”’ of R: the least addition of 
points to R which results in a convex body. < 


Suppose that the region R’ of some game is of the form shown in Fig. 3. 
By acting jointly, the players can achieve any point of R’ as their payoff. 
A point (u, v) of R’ is said to be jointly dominated by a different point (u’, v’) 
of R’ if both u’ 2 uw and v’ 2 v. Clearly, the players need not consider 
any point which is jointly dominated by another one of R’. Thus, after 
a little preliminary negotiation, they can be expected, if they are rational, 
to confine their attention to the jointly undominated outcomes, which in 


‘Formal mathematical definitions of such words as “boundary,” “interior,” and 
“exterior” can be given, but it is hardly appropriate to do so here. 
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6.3 SOLUTIONS—IN WHAT SENSI 
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Although von Neumann and 
restrictions can be placed upon 


several other authors have attempted to sing! 


9 which they put forward as a “realistic” solution to the ; “mM 
For example, in section 6.9 we shall discuss a scheme due to Nash which 
formalizes in a precise, but particular, fashion the preplay c ymmunication 
and bargaining. The resulting extensive game is treated as a non-cooper- 


ative game without preplay communication, and one of its equilibrium 
points which has a special mathematical property is taken to be the solu- 
tion to the cooperative game. This will be preceded by a discussion of 
Nash’s bargaining model (section 6.5) which is a stepping stone for his 
treatment of the general cooperative game. 

In what sense can any such unique point be considered a solution? 


For example, in the game 


there are many who would object to choosing (32, 34) as the cooperative 
solution. To be sure, they would agree that the symmetry of the matrix 
affords some argument for choosing the symmetric outcome in 0, but they 
would question whether a possible asymmetry of the roles of the players 
has not been overlooked. For example, player 1 may refer to a whole 
labor union and 2 to a single entrepreneur; or 1 may be the government 
serving the interests of the nation, whereas 2 isa monopolist. But if such 
role asymmetry exists, then surely “solution” is not meant in an ethical 
sense—as a concept of a just outcome. Furthermore, it surely cannot be 
meant in a descriptive sense, since the players may also be asymmetric in 
bargaining ability, one being a tough bargainer and the other indifferent. 
So, in what sense is it a solution? 

Indeed, one can argue that the whole of Xt is probably not suitable as a 
solution in a descriptive sense. Consider the game 


(0, 100) (100, 0) 
(—1, —200) (—40, —300) |. 


Let us suppose that the payoffs are in dollars and that the two players are 
in roughly equivalent financial positions. In the non-cooperative game 
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the bargaining personalities of the two players will begin to enter: Each 
can hurt the other in differing degrees and so they will dicker, threaten 
withdraw, bluff, etc., until an outcome is reached. 

To predict what will in fact happen without first having a Complete 
psychological and economic analysis of the players seems foolish indeed 
We would claim no descriptive limits: anything is liable to happen— 
including the (—40, —300) payoff! Specifically, we would not claim that 
an actual observed outcome will lie in the negotiation set. 
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6.4 ARBITRATION SCHEMI 
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Let us suppose that the | 
restricted their attention to th 
bargaining over which point 
more he will probably get 
threat! This is the rub about coo} 
in such situations so-called “rational” peopi ; countries) frequently 


have failed to reach an agreement, and the threats have ! d to be carried 
out, to their mutual discomfort. For these reasons players are often 
willing to submit their conflict to an arbiter, an impartial outsider who 


will resolve the conflict by suggesting a solution. 

We may suppose that the arbiter sincerely envisages his mission to be 
“fairness” to both players; however, there are not, as yet, any simple and 
obvious criteria of ‘‘fairness,” so, in effect, he is being asked to express a 
part of his ethical standards when resolving the game. The arbiter can 
be assumed to want to suggest a solution which will seem ‘‘reasonable,”’ 
both because he is sincere and because he may wish to be hired for such 
tasks in the future. Thus, for example, he would be mistaken to suggest 
a solution having an obvious alternative which is preferred by both 
players. Or suppose there are two different conflict situations and that 
everyone agrees player 1 is strategically better off in the first than the 
second; then the arbiter should not give player 1 less in the first than the 
second. Inshort, an arbiter will (or should) try to satisfy some consistency 
requirements. In addition, as with most adjudicators, he will be anxious 
to defend his suggested solutions with some fairly good rationalization. 
All of this means that he should be prepared to formulate and to defend 
the basic principles which lie behind his suggested compromises—they 
should not be completely arbitrary! 

Whatever principles may be involved, the net effect is to associate to 
each possible conflict of interest a single outcome. Thus, we define an 
arbitration scheme to be a function, i.e., rule, which associates to each con- 
flict, i.e., two-person non-strictly competitive game, a unique payoff to the 
players. This payoff is interpreted as the arbitrated or compromised 
solution of the game. Without further specification, there are clearly an 
infinity of such functions. We could try to select one on an intuitive basis 
and attempt to defend it; however, we should always fear that someone 
might concoct a hypothetical situation for which the arbitrated solution is 
at variance with our intuitive “ethical norms.” Rather than dream up a 
multitude of arbitration schemes and determine whether or not each 
withstands the test of plausibility in a host of special cases, let us invert 
the procedure. Let us examine our subjective intuition of “fairness” and 
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which are acceptable. The onlv alternative—to examine in detail each 


of the infinity of schemes for each of the infinity of possible conflicts it is 
supposed to arbitrate is not practical. 

What are some reasonable principles for an arbitration function to 
fulfill? At this time we do not wish to go into great detail, and so the 
reader should be tolerant of the following formulations; they are stated in 
a very rough manner—ambiguities exist and misinterpretations may be 


anticipated. However, such ambiguities will be eliminated once we make 
the conditions mathematically precise. 


‘ The arbitrated solution of a specific conflict situation (two-person 
non-strictly competitive game) should be an element of the negotiation 
set of the game. In other words, the arbitrated solution should give each 
player as much as he could be expected to gain if he played non-coopera- 
tively, and there should not be any other feasible payoff preferred by both 
players to the solution. 

ii. The arbitrated solution, as seen in terms of the real underlying con- 
flict, should not depend upon the particular utility units used in abstract- 
ing the problem into a formal framework. 

iii. The arbitration scheme should be egalitarian in the sense that it is 
independent of the names or labels of the individuals in the conflict. 

iv. If two games are “close to each other” in some strategic sense, the 
arbitrated solutions should also be close. Put another way, slight per- 
turbations or errors of measurement should not drastically alter the 
arbitrated solution. 

yv. The arbitrated solution should reflect the threat capabilities of the 
players in the conflict situation. (This condition, in particular, needs 
considerable clarification.) 


It is all too easy to say “‘Ah yes, these are very reasonable conditions,” 
so let us append a word of caution. It is often difficult to assess iow, 
reasonable an axiom actually is in its abstract setting; we must seek its 
meaning in concrete contexts, looking particularly for cases where it leads 
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its boundary). In Fig. 4 a typical R is shown. The region will be 
polygonal, as shown, if the underlying set of trades is finite, ie., if no 
commodities are infinitely divisible; otherwise it need not be. 

The choice of utility functions is not unique, as has been stressed, but, 
if we had made other choices having different units of scale and different 
origins, the new region would be merely a translation and stretching of 
the old one. Thus, the same underlying bargaining problem has an 
) infinity of simply related abstract versions. 

In summary, then, a bargaining problem is characterized by a region 

| R of the plane and a special point (u*, v*) of R. We denote such a 
bargain by [R, (u*, v*)]. The following interpretation holds: If no trade 

occurs, the payoff is (u*, v*), and a trade takes place if and only if both 

players agree upon a unique point of R which then constitutes the payoff. 
Naturally, 1 desires a trade represented by a point as far to the right as 

possible in R, and 2 wishes to obtain a point as high in R as possible. In 

general, of course, these are incompatible desires; nonetheless, it behooves 
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Assumption 2 (Pareto optimality) 


have the following properties: 
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In words, the arbitrated value must | 
feasible, and (212) not bettered by any other feasible poini 
Assumption 3 (independenc e of irrelevant alternatives). Suppose hat 
two different bargaining games have the same status 
possibilities of the one are all included in the other. 
game with the larger set of alternatives is actually a feasible trade in the game with 
the smaller set, then it shall also be the arbitrated value of the latter game. Put 
another way, if certain new feasible trades are added to a bargaining problem tin 
such a manner that the status quo remains unchanged, either the arbitrated solu- 
tion is also unchanged or it becomes one of the new trades. In symbols, suppose 
[Ri, (u*, v*)] and [Ro, (u*, v*)] are the two games and that 


Guo point and that tile i ading 
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[f the arbitrated value of the 


(i) Ry ts a subset of Ra, 
(ii) F[Re, (u*, v*)] ts in Ri, 
then 
FLRi, (u*, v*)] = F[Re, (u*, v*)]. 

Assumption 4 (symmetry). Suppose the version [R, (u*, v*)] of a bargain- 
ing game has the following properties: 
2 = 2" 
(ii) Tf (u, v) ts in R, then (2, u) is in R, 
(ili) (uo, 20) = F[R, (u*, 0*)], 


then 
uo = Vo. 


In words, if an abstract version of a bargaining game places the players in com- 
pletely symmetric roles, the arbitrated value shall yield them equal utility payoffs 
where utility is measured in the units which made the game symmetric. : 
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satisfies ites four assumptions, it is the only function which does so. Put sone 
Way, these desiderata implicitly define i eas 

th a unique arbitr 
os. q ation scheme for 


The proof is sim i ine i m 
: ple, so we will outline it for th i i 
: € mathematical] 
onsider a version [R, (u*, v*)] of a bargaining game, and let [R’ Hi ' 
5) 5) Cc 


ive Games 


t oat the orl 

f es atus qu : 
A which puts th i >) ‘s : | 

: : a 4 f af maps into 

| A hat (uo 9 v0 ) I @ 

: ie We now show that 


r+ y*) is the soluti¢ 
v0 ) ‘? 
ch contains R", and s 
ding to assu 
4. Accordins 
1 andy 2 


: it iS also 
at ¥ mption 3 it 1s a 
(x, y) ie ie" (0, 0)]; hence #4 ae a ihe problem. hat it is ' 
tion ’ : solu 4 ae. ae ean 
rhs the Nash a Poe F’ is a function distin: Lich is also 
: or 


(0, 0)], from a (uo! + #'> 
R (0, 0)} and | Rit whi 
bets a symmetric st 
ex 


ique solution 18 287» © F at some version [R, (u*, z he above cop, 
uniqu If it disagrees with Se ution ex’ Naih’s ane 
solution. 1 dicted. This shows that il a xioms 


struction is contra 


fi ] . factitd Peicerim sil 
at in fact it Goes AIST wiem q 
itm 


6.6 CRITICISMS OF NASH'S MODEL OF THE BARGAINING PROBLEy 
A number of criticisms can and have been made of Nash’s model; some 
that we shall examine do not appear to us to be relevant, but others seein 
more serious. Some confusion has resulted from Nash’s presentation of 
the problem. He has always confined himself to formal versions of the 
bargaining situation in which the status quo trade is at the origin, or, in 
other words, the zeros of the individual utility functions were arbitrarily 
chosen tobe the case ofnotrade. “In a bargaining situation one anticipa- 
tion is especially distinguished; this is the anticipation of no cooperation 
ee eet It is natural, therefore, to use utility fone 
uals which assign the number zero to this anticipation. 
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aa leaves each individual’s utility function determined only up 
multiplication by a Positi 


Caused confusj ve real number.”? [1950 b, p. 157.] This has 
Beaite lov ie for many readers have falsely interpreted this to mea”? 
unnatural but eo Y. They have argued that not only is this choict 
of utility, 1 has . eety establishes an interpersonal compat!s?” 
i : en aR hs ; : wise 
ngenious argument eld that it is a serious flaw in an other! 
Nash’s di : LD Our Presentati d from 
cussion so g ; mation we purposely depart 


S to . ions: 
c second Soup of criti Sive a simple refutation of these contentio™ 
San €rtainly, it Hicisms surround the meaning of this «solution 
argaj ; ’ Is not a ek s jn 
ens 
reach ae asy to cit Prediction of what actually hapP¢ 


© empiri no 
Plrical cases where an agreement is 


I UD a : ten $ 
uw 


ts ee : IVisi . 
Solution shoupa ctancies™ ae which purportedly should r¢ 0 
Salners, +, “Onsist ee onal bargainers.”” “Now, since” 


Ist 
> MNese Tatio 
Me &xpectat: mal ex _ . a6 
nt between thee lations shoy] d fe tations of gain by the t pe 
tWo. 


ence th realizable by an appropriate °° od 
“Te should be an available antic! 


— 


6.6] Criticisms of Nash’s 


which gives each the amount « 
reasonable to assume that 
that anticipation or to an « 


point in the set of the gra 


senting all anticipations t 
[1950 }, p. 158.] It is not « 
other than to say that assumpti 
1 and 4 stipulate the principle of 
“fairness”; nonetheless, the spirit ie 
of the argument is clear. es! oN 
If this is accepted, then most 
remaining criticism takes the form 
of examples where it is contended 
that the Nash solution is not fair. 
We feel that often these criticisms 
are not just to the arbitration 
scheme, since the critics seem to be 
demanding that a “fair” arbitra- 
tion scheme yield a “‘fair”’ solution 
when applied to an “unfair” situa- 
tion. Consider the game where 
two players are to divide $100 
between them if they can reach an agreement, or where they will receive 
nothing if they do not agree. The region of possible outcomes is shown in 
Fig. 5. If they can agree upon a point in this region, bargainer 1 will 
receive the value of the first coordinate and bargainer 2 the value of the 
second one; if not, they each get zero. If utility is assumed to be linear 
with money, then the Nash solution is (50,50). However, suppose there is 
an asymmetry in the economic roles of the players. For example, suppose 
1 is rich and selfish and 2 is poor. Player 1 may then make a good case 
for (75, 25), let us say, on the grounds that the utility increment to 2 of 
$25 is at least as great as the gain to 1 of $75. He would argue that an 
ethically fair division of the proceeds is a fair division of utility, not of 
money. For example, suppose that (part of) the utility functions were 
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Player 1's utility 
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these: 
Monetary Payoff Utility 
1 2 1 2 Product of Utilities 
$100 0.00 1.00 0.000 
75 0525 0.98 0.245 
50 0.50 0.90 0.450 
25 0.75 0738 0.548 
0 1.00 0.00 0.000 
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Nash solution to the bargaining problem. : 
Note well how the asymmetry in the roles of the players enters into the 


solution. The model does not equate the status quo positions of the 
players, but the asymmetry of economic roles is partially reflected in the 
shape of their utility functions. Ethically, this may be unfortunate, since 
the economic asymmetry works to the detriment of the poor man and to 
the benefit of the rich one. This has been cited as an example to show 
that the Nash axioms are not “ethically fair.” We must point out, how- 
ever, that Nash’s solution only purports to give a “fair” arbitrated value 
er te Sie aspects are taken into account. These are 
eis “pociet it Baovens, and, had the utilities of the rich man 
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Suppose, first, that the players are in roughly equi\ alent economic 

ositions, then player 1 can make a good case for the point on the line 
joining (10, 0) to (0, 100) in whic h both players get the same rewards, 
approximately (9.09, 9.09). His argument can Db j 

based on two grounds: First, the threat that, if 2 hold 100 

out for ($5, 50), then 1 will not agree toit. This he can 

afford to do since he will lose much less than 2 will. 
Second, the ethical argument that the reference point 
really should be (0, 0) and that each should be made to 
gain equally. Player 2 would surely argue that if they 
move from (5, 50) to (9.09, 9.09), then he will have given 
up $40.91 and 1 will have only added $4.09. Again, 1 
would question why (5, 50) is the reference point rather 
than (0, 0), claiming that the asymmetry of the problem 
is far more apparent than real. ‘For instance,” he 
might argue, “‘suppose that there were two games under irate 
consideration: the one we are playing, which I shall call ete 

A, and another, B, which has the payoff region shown Player 1's utility 
in Fig. 7. In B it is reasonable that we should each get 
$5. This is how much you say that I should receive from 
game A; however, I personally would much prefer to play in A than in B. 
That I do presumably means only that I expect to get more out of A than 
out of B, and not that it gives you a much better return with no benefit to 
me, Iam not reluctant to let you have more in A than in B so long as I 
get something out of it. You 
should not get false aspirations 
because of the asymmetry of the 
region of payoffs in A—after all, I 
control the outcome as well as you 
and (0, 0) looks awfully symmetric 
to me!” 

Such an argument implicitly as- 
sumes the existence of an inter- 
personal comparison of utility. 
Nash feels that such comparisons 
are not meaningful and that such 
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Player 2’s utility 


0 an argument cannot, therefore, be 
Player 1’s utility made sound. We would agree that 
Fic. 7 interpersonal comparisons have not 


been given a rigorous meaning, but 
we feel that an abstraction completely omitting such considerations is 
perhaps departing too far from reality in certain contexts. In many bargain- 
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payoffs are in money, that utili 
if they do not agree upon 
thing. In each game the 
whether (5, 50) is a reasonable 
it should be for game B 7/ it is f 
of the independence of irrelevant alternat 

In judging the plausibility of a potenti } ; 
to evaluate the “levels of aspiration” of the players, for that is certainly one 
of the psychological factors often 
involved in bargaining tempera- 
ments. The critics of the third 
assumption argue that whatever a 
“fair” solution to game A may be, 
a “fair’’ solution of game B should 
yield less to 2 than the solution of 
game A, for in game A his poten- 
tialities are far greater. Put 
another way, let us suppose the 
players have agreed upon a solution 
to game B, and then they are told 
that actually the game is 4; it is 
reasonable for player 2 to argue 


Player 2's utility 
Player 2's utility 


10 ~ 10 
that he aad deserves more. If so, Player 1’s utility Player 1’s utility 
assumption 3 is violated. eee Game B 

We feel at this time—the implica- ie: 
IG. 


tion being that we have changed our 
minds in the past—that this argument against assuming independence of 
irrelevant alternatives loses its appeal when applied to bargaining prob- 
lems; the reason is that the naturally distinguished trade, the status quo, 
serves to point out that certain aspirations are merely empty dreams. 

A word of caution about this assumption: although it may itself be 
reasonable, there are numerous related assumptions which appear to be 
equally plausible at first glance but which are not. For example, con- 
sider this one: If new trades are admitted in a bargaining game and the 
Status quo point is held fixed, this should either not affect the solution or 
the new solution should be one of the new trades. Furthermore, if there 
are new trades which both players prefer to the old solution, the solution 
to the new game should be one of these preferred new trades. This 
assumption, while apparently reasonable, is contradictory if Pareto 
1 Gomes is assumed. We show this by considering three games—A, B, 

nd C— . : eNae ts 
The aaa associated regions R 4 Ro, and Rg are shown in Fig. 9. 
4 is simply the straight line from the origin to the point a, 
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Symmetry, assumption 4, is Open 
to major ethical criticisms in 
certain contexts. For example, 
player 1 might be a single indi. 
vidual and player 2 a whole com. 
munity, then whether or not we 
wish to assume symmetry—aay, 
equality before the law—very 
much depends upon the context, 
Nash asserted [1950 4] that this 
Player 1's utility assumption ‘‘expresses equality of 

a 9 bargaining skill,” but later [1953] 
he disavows this interpretation. 
» for otherwise one has the impres- 
layers of “equal bargaining skill” 
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6.7 ALTERNATIVE APPROACHES 


Harsanyi [1956] notes that Zeuthen’s [1930] soh 
problem is mathematically equivalent ; 
solution, and he feels that Zeuthen’s formulation has the ad 


“supplying a plausible psychological model for the actual bargaining 
process.” In examining this formulation we shall continue to use our 
former terminology, but the spirit of the Harsanyi-Zeuthen remarks will 


be conveyed. 

Let a bargain be given with the region R of possible payoffs and the 
status quo at the origin. Suppose that player 1 is holding out for a trade 
with utility payoffs (w1’, uo’) and 2 is demanding (u,’’, ue’’), where the 
two points are different and each is Pareto optimal. Who should make a 
concession? ‘The argument is, and we shall examine it in detail later, 
that player 1 ‘“‘should’’ make a concession if and only if 


uy’ s. uy” 2 in” aA ‘ust 
= 


, 
uj u2 


and player 2 should make it if the inequality is reversed. It is easy to see 
that this inequality is equivalent to 


uy'us! -< uy! us!" 


Concession need not necessarily mean accepting the opponent’s demand; 
rather, the conceding player can suggest an alternative trade which will 
not require him to make a further concession in the next round of negotia- 
tions. But, for this to be so, he must propose some (w{’’, u3/’) having a 
component product w//’us"’ at least as large as the component product of 
his opponent’s demand, and larger if possible. Clearly, this procedure 
raises the component product at each stage, and so it inexorably leads to 
the point for which the component product is a maximum—Nash’s 
solution. 

As presented, the concession principle is totally arbitrary, but Harsanyi 
and Zeuthen have attempted to provide some rational underpinning for 
it. When the two demands are (uy’, uo’) and (u;”’, ue’’), then very crudely 
(ur — uy!")/uy’ and (u2’’ — uy')/u2’’ measure, respectively, the relative 
losses incurred when players 1 and 2 concede. The assumption, then, is 
that the player whose relative loss is the smaller will concede. 

A further derivation of the concession principle is presented by Harsanyi 
which is based on postulated human behavior. He assumes, among 
other things, that each player knows his opponent’s subjective proba- 
bility of conceding, in which case it becomes meaningful to consider the 
©xpected) utility of conceding or not conceding. This assumption, plus 
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joining that point and (uy", u2’’). This ation leads, in gene 
non-linear motion from the stat lone what ma 


“negotiation curve’ to the arbit 


Both of these schemes satisfy all o! ; axioms save the independence 
of irrelevant alternatives, and depending upon one’s point of view this may 
or may not be a fault. 

It should be pointed out in conclusion that, in the continuous motion 


model, the slopes of the negotiation curve and of the Pareto optimal curve 
are of the same magnitude at their point of intersection, but of opposite 
sign. If one “linearizes” this model by demanding that the negotiation 
curve be a straight line having this same relation between its slope and that 
of the Pareto optimal curve at their point of intersection, then the arbi- 
trated point is Nash’s point where the product is a maximum. 


6.8 ARBITRATION SCHEMES FOR NON-STRICTLY COMPETITIVE 
GAMES: THE SHAPLEY VALUE 


Having digressed into the philosophy of arbitration and discussed 
simple bargaining games, we return in this and the following two sections 
to the general case of non-strictly competitive games. It will be recalled 
that we denote by R the set of all possible pairs of payoffs when the two 
players use uncorrelated mixed strategies and by R’ all possible pairs of 
payoffs when the players cooperate and adopt joint randomized strategies. 
Every point in R is in R’, but the converse is not necessarily true. By 
(v1, v2) we denote the payoffs representing the security levels of the players, 
i.c., player 1 can guarantee himself an expected return of v, by a suitable 
strategy choice, and v2 is similarly defined by changing the roles of 1 and 
2. The negotiation set 9 is the part of the northeast boundary of R’ 
which dominates (v1, v2). ‘The problem is to single out a point of the 
negotiation set of each game as its “‘solution.” 

One simple way to do it is to treat the cooperative game as the bar- 
gaining game [R’, (v1, v2)]. That is, a given non-strictly competitive 
game induces a bargaining game where the status quo is taken to be the 
security level of the players, and the solution of the game is taken to be the 
Nash solution of the bargain associated with the game. This procedure 
has some nice properties: it picks out a unique point of the negotiation set, 
it is invariant with respect to the origin and the scale of utility measure- 
ment, and it is symmetric or egalitarian in that it does not look at the 
labels of the players. 

We may call this the Shapley procedure on the grounds that it is a slight 
generalization of a very special case of the Shapley value of an n-person 
game. The latter notion will not be discussed until section 11.4, but it 
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dure for solving two-person cooperative gam Let the game have only 


two pure strategies for each player and let the payoff matrix be: 


Qa] (1, 4) ( | 

a9 S 1) t } | 
The bargaining region R’ is that shown in Fig. 12, and th security levels 
are (0, 0). The maximin strategies are (44a), Yay) and (1461, 1482). 
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Since the induced bargaining situation for this game, [R’, (0, 0)], is sym- 
metric, it is easily seen that the Nash solution is (54, 34), which is the mid- 
Point of the negotiation set, i.e., the midpoint of the line segment from 
(1, 4) to (4, 1). 
Observe that this induced bargaining game, and the solution, are per- 
fectly symmetric between the two players, but that the game itself is not 
—Player 2 has a distinct advantage. For suppose 2 threatens to play 
. Strategy 61, then what alternatives has 1? If he plays a, then the payoff 
B (1, 4)—the best possible for 2 in the negotiation set of the game. There- 
°re 1’s only realistic counter threat is a2, leading to the payoff (—4, — i). 
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6.9 ARBITRATION SCHEMES FOR NON-STRICTLY COMPETITIVE 
GAMES: NASH’S EXTENDED BARGAINING MODEL 


By selecting a different status quo point than in the Shapley procedure, 
Nash [1953] has extended his analysis of the bargaining model in much 
the same way to yield an analysis of all two-person non-strictly competi- 
tive games. Roughly, his idea boils down to this: Each player adopts a 


mixed strategy as a “threat”; the Pair of threats establishes a payoff, 
which, in turn, acts as the status i 
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non-cooperative game of which it is ; quilil m point and 1XiO1 
tizing the solution of a general cooper 
of these. 

The first procedure reflects Nash’s belie! he non-cooperative 
games are more basic than cooperative 
5 


cooperative game to a non-cooperative game having as fort 
defined moves any negotiation and b ining 
In this case, he proposes that it be done in the following manner: 


S 


Move 1: Player 1 chooses a mixed strategy x. 

Move 2: Player 2, with no information about the choice made at move 
1, chooses a mixed strategy y. 

Move 3: Player 1, knowing the choices at moves 1 and 2, makes a 
demand d,, i.e., chooses a number d}. 

Move 4: Player 2, knowing the choices at moves 1 and 2, but not at 
move 3, chooses a number dp. 


Moves 1 and 2 and moves 3 and 4 can be thought of as pairs of simul- 
taneous moves. Any play of this extensive game can be described as a 
4-tuple (x, y, di, d2). The payoff associated with such a play is defined 
as follows: 


i. If (di, do) is a point of R’, i.e., if it is a feasible payoff in the game G, 
then players 1 and 2 receive d, and dg, respectively. 

ii. If (d, dz) is not a point of R’, then the payoff is [M41(x, y), Mo(x, y)], 
where M, and Mz are the payoff functions in the game G. 


A strategy for player 1 in this extensive game is a pair (x, d;), where d, 
may depend upon x and y; for player 2, a strategy is a pair (y, d2), where 
dy may depend upon x and y. Nash asserts that optimal strategies for 
the players are (x*, v,*) and (y*, v2*), where these quantities are defined 
above. 

It is true that these strategies are in equilibrium and that they yield a 
Pareto optimal payoff; however, as Nash is well aware, there is in general 
a4 Continuum of other inequivalent equilibrium pairs. The weak link in 
the argument is to single out this particular pair. Nash offers an ingeni- 
ous and mathematically sound argument for doing so, but we fail to see 
why it is relevant. 


Thus the equilibrium points do not lead us immediately to a solution of the 
8ame. But if we discriminate between them by studying their relative stabilities 
Wwe can escape from this troublesome non-uniqueness. 

To do this we “smooth” the game to obtain a continuous payoff function and 
ana study the limiting behavior of the equilibrium points of the smoothed game as 

© amount of smoothing approaches zero. [1953, p. 131.] 
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consisting of seven axioms which are aad . ape of the 
negotiation model. As could be expected, ee Oo oe 
“appropriate modification” of the principles used in the bargaining 
model, namely: (1) feasibility, i.e., the solution should be in R’; (2) 
Pareto optimality; (3) invariance with respect to utility scales; (4) sym- 
metry, i.€., independence of labels of players; and (5) independence of 
irrelevant alternatives. The remaining two axioms describe the behavior 
of the “solution” when the domains of available strategies are modified, 
but the payoffs remain fixed. Axiom 6 requires, roughly, that, if a 
player's choice of strategies is restricted while at the same time the other 
og ae is used and the payoffs are held fixed, then 

€ solution cannot Increase. ‘“‘A player’s position in the 
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employing his “‘threat’”’ and “‘demand”’ y 
tion model and the axiomatic approach are quite similar in spirit and that 
they serve to complement each other very well. 


6.10 ARBITRATION SCHEMES FOR NON-STRICTLY COMPETITIVE 
GAMES: THE CASE OF MEANINGFUL INTERPERSONAL 
COMPARISONS OF UTILITY 


In the preceding sections we have been concerned with arbitration in 
situations where interpersonal comparisons of utility are assumed to be 
meaningless; in this section we shall suppose that they can be given mean- 
ing. Raiffa [1953] considered both cases in his work. For the case 
where they are meaningless, he suggested a class of procedures out of 
which he'singled one for special attention. Even though the two authors 
worked independently and devised quite different rationales for the 
procedure, Raiffa’s special scheme is operationally identical to Nash’s 
extended bargaining model (section 6.9). Of these two rationalizations, 
Nash’s is the less ad hoc. In the context of meaningful interpersonal com- 
parisons, Raiffa has offered an arbitration scheme which deals directly 
with the cooperative game and which is independent of Nash’s solution of 
the bargaining problem. ‘This we shall now discuss. 

Depending upon the situation, we may or may not wish to permit side 
payments, which simply means we choose to deal with different regions 
of payoff pairs; the analysis is the same in both cases. For ease of discus- 
sion, let us suppose that the individual security levels are both zero, and 
so the negotiation set is the northeast boundary of R’ from a to 8, as 
shown in Fig. 13. If we take any point (wi, w2) of R’, then the relative 
advantage to player 1 is uy — ue. All points of R’ having the same rela- 
tive advantage obviously lie on the 45-degree line passing through 
(u1, ua), so the contour lines of constant relative advantage are all the 
45-degree lines. 

Suppose, for the moment, that player 1 has a strategy x* such that, 
regardless of player 2’s choice, his payoff in the non-cooperative game is 
o ; ; : ; 3 
| eae the contour line passing through some point ¢ in the negotia- 

- Similarly, suppose 2 has a strategy y * such that, independent of 
1s choice, his payoff is on or above the same contour line. If so, then we 
submit that ¢ is a ‘‘reasonable”’ candidate for the arbitrated solution. 
oe is this: If 2, for example, wishes to move from c, then 1 can 

€n to use x* which will maintain or increase the relative advantage 
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it is resolved they cooperate fully to increase their payoffs as much as 
i Bis | 
possible while preserving the relative advantag 

The procedure may be illustrated for the gaz nati 
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studied in section 6.8. It is easy to compute that the induced zero-sum 


The value to 1 of this game is clearly —3. Since the point (1, 4) of the 
game yields a relative (dis)advantage of —3 to 1 and is also on the north- 
east boundary of the region (see Fig. 12), we concluded that it is the 
arbitrated solution. This, it will be recalled, was the solution we obtained 
before when we admitted that the game is not symmetric in the sense that 
(—4, —1) should be the status quo point, not (0, 0). 

Such a procedure does not depend upon the origins of the utility func- 
tions, but it is sensitive to changes in units of measurement. ‘Thus, when 
we speak of relative advantage, we are assuming that there is a common 
unit of measurement. For many situations money serves this purpose. 
In some contexts where money is not appropriate, it may be possible to 
determine a common unit by choosing two stimuli to serve as reference 
points for equating tastes. In still other cases, interpersonal comparisons 
may be deemed entirely inappropriate, in which case this procedure is 
useless. It would be desirable to have a method for establishing inter- 
personal comparisons within the framework of the game itself, equating 
Certain distinguished values of the game. Two ad hoc methods for doing 


so are discussed in the next section, and they are illustrated by a specific 
example. 


6.11 TWO DEFINITIONS OF INTERPERSONAL COMPARISONS 
IN TWO-PERSON GAMES 


To illustrate the definitions which will be presented, consider the fol- 
lowing amusing conflict situation introduced by R. B. Braithwaite in 
Theory of Games as a Tool for the Moral Philosopher [1955]: 


ere that Luke and Matthew are both bachelors, and occupy flats in a 


all which has been converted into two flats by an architect who had ignored 
4. ousiderations of acoustics. Suppose that Luke can hear everything louder 
* 4 conversation that takes place in Matthew’s flat, and vice versa; but that 
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/ The overall procedure, then, is to take the arbitrary payoff matrix, 
reduce it to canonical form by transforming the utility scales so that the 
| most preferred outcome has utility 1 and the least preferred 0, find the 
solution as given in section 6.10, and then convert the answer back to 
utiles in the original measurement scales. This procedure has two inter- 
esting features: First, it yields an arbitrated solution which is invariant with 
respect to origins and units of utility measurement—even though as a technical 
device a specific pair of scales were singled out as an integral part of 
) the analysis. Second, it satisfies all save one of the axioms demanded by 
Nash for “any reasonable value.” The exception is the independence of 
irrelevant alternatives. But for this reason, the procedure is open to 
serious criticism. For example, one can add irrelevant strategy alterna- 
tives, any pair of which will certainly not be adopted by the players, which 
alter the least favorable outcomes, and hence may alter the arbitrated 
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1 to at most 3.25. 
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arrangement is relatively advantageous to Matthew, and it is worth 
examining why this is so. Matthew’s advantage arises purely from the 
fact that Matthew, the trumpeter, prefers both of them playing at once 
to neither of them playing, whereas Luke, the pianist, prefers silence to 
Cacophony.” [1955, p. 37.] Matthew has the threat advantage. 
Braithwaite makes two other observations which seem, to him, to bolster 
his arbitrary normalization of the utility scales. First, the northeast 
boundary of the region R is actually a parabola whose axis makes a 
45-degree angle with the horizontal when his procedure is used to nor- 
malize the payoffs. In general, then, he chooses contours of constant 
relative advantage which are parallel to the axis of the parabola which 


forms the northeast boundary of R. Second, his choice of scales also 
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Nonetheless, we consider his procedure, in particular the rationalizations 
of his definition of the common unit of measurement, arbitrary. This js 
not necessarily to be interpreted as an unfavorable criticism of the “reason. 
ableness” of the procedure, but rather as a recognition that a clinching 
argument, showing how this method is better than others, has not yet 
been produced. With this, we are sure Braithwaite would agree. 
>We have considered two apparently quite different methods of arbitration: In 
sections 6.8 and 6.9 the analysis de 
rested upon a prior analysis of bargaining games. In section 6.10 and this one 
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6.12 STABILITY OF ARBITRATION SCHEMES 


When the bargaining model, or any other game theoretic mechanism f 
that matter, is applied to an empirical problem, the utilities used must be 
determined by experimental techniques. The; are, therefore, certainly 
going to be in error, and so it would be most unfortunate if small perturba- 


tions in the utilities could produce drastic changes in the arbitrated 
solution. In other words, we should demand of an arbitration scheme 
that the arbitrated solution be a continuous function of changes in the 
utilities. 

We shall say that an arbitration scheme is stable 
(or, equally well, that the arbitrated solution is 
continuous) if it possesses the following property. 
Let G™, where n = 1, 2, - + + , be a sequence of 
games, with the nth game having the joint payoffs 
(ap, b;}) and arbitrated values (v{”, »%”). Suppose 
that the sequence approaches the game G in the 
sense that the numbers a;} and b{” approach, 
respectively, the corresponding payoffs a;; and b;; 
of G as n approaches infinity. The scheme is stable 
if for each such convergent sequence it is also true 
that the numbers v{” and v) approach, respectively, the arbitrated values 
vy and vo of G. 

It is easy to give examples of unstable schemes. Consider bargains 
where the status quo is at the origin and where utilities are interpersonally 
comparable. Let the arbitrated value be that point on the negotiation 
set having the largest utility component, provided this point is unique; 
otherwise, let it be the point of the negotiation set which gives each bar- 
gainer the same utility. In the problem shown in Fig. 16, the arbitrated 
value is: 


o 


Player 2’s utility 


a 
Player 1's utility 
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(a, 0), ifa > 6, 
(0, 4), ifa <b, 
(a/2,a/2), ifa=b. 


Obviously, in the neighborhood of a = 4, slight perturbations in the 
utility values can alter the arbitrated solution drastically—from (a, 0) or 
thereabouts to (0, a) or thereabouts. 

None of the schemes which have been considered seriously either here 
or in the literature exhibit such a trivial pathology; however, they uni- 
versally possess a more subtle instability. We may illustrate it in a 


_ Tepresentative special case. Consider the two bargaining regions shown 


in Fig. 17. In this example we shall not suppose that utilities are inter- 
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For the two-person cooperative games, it w uggeste at reasonable 
axioms might specify that the point lie in the negotiation set; require that 
the point be independent of the utility units used and of the labeling 
of the players; require stability in the sense that slight perturbations of the 
payoff entries not drastically affect the arbitrated value; and reflect the 
threat capabilities of the two players. The rest of the chapter was 
devoted to more precise statements of specific axiom schemes. 

The first of these was restricted to the class of cooperative games where 
only exchanges of goods occur: bargaining problems. Nash assumes that 
solutions to this problem should satisfy: invariance with respect to utility 
transformations, Pareto optimality, independence of irrelevant alterna- 
tives, and symmetry. From these it follows that there is a unique solu- 
tion which may be obtained as follows: translate the utility scales so that 
the status quo point is at the origin, find the point for which the product 
of the two coordinates is a maximum, and then invert the utility trans- 
formations. A number of criticisms of the axioms were presented and 
discussed. 


Several approaches to unrestricted cooperative games were described. 
The first, which we called the Shapley procedure because it is a slight 
extension of the Shapley value from n-person theory, takes the maximin 
values as the basis of bargaining. An asymmetric game which is sym- 
metrically resolved by the procedure was offered as a criticism. 

Second, we described Nash’s extension of his solution to the bargaining 
problem. ‘The easiest way to present it is as a reduction of the coopera- 
tive game to a non-cooperative one; however, this is completely ad hoc. 
Alternatively, an axiomatic method for obtaining the same result was 
sketched. In essence, two axioms were added to those for the bargaining 
problem, but their rationalization did not seem adequate. Although we 
were Critical of both of Nash’s separate approaches to this problem, we felt 
that each helped to support the other, and that collectively they have 
much merit. The Nash solution was independently arrived at by 
Raiffa who used a different type of rationalization. 

The next procedure, one of several suggested by Raiffa, rests upon an 
assumed intercomparability of utility. The cooperative game is trans- 
formed into a zero-sum game of relative advantages, its value is obtained, 
and the corresponding contour of relative advantage is found in the 
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i utility transformations. As an illustration 
original Sel Bc geaics ‘ be transformed so that the most 
Be scconce has utility 1 and the least a i. ; ge: 
suggests an alternative transformation in whic the utility interva from a 
player’s maximin strategy (based upon his payoffs) to his ‘minimax 
strategy (based upon his opponent’s payoffs), under the condition that 
his opponent uses his maximin strategy, is taken as the unit. Both pro- 
cedures satisfy all of Nash’s axioms save the independence of irrelevant 
alternatives. Note that the solutions are independent of separate changes 
in utility units for the players. 

A strong technical similarity was established between these last pro- 
cedures, which rest upon contours of relative advantage, and Nash’s exten- 
sion of his bargaining solution, 
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THEORIES OF »-PERSON GAMES 
IN NORMAL FORM 


7.1 INTRODUCTION 


The theory of games would be a very incomplete edifice, both estheti- 
cally and practically, if it were restricted to the two-person case. It is 
not. In this and the following five chapters we examine the general 
theory which is, in the main, very different from the two-person theory 
and, we are forced to admit, less satisfactory. 

Intuitively, it is reasonable to suppose that the two most significant 
notions of the two-person theory—mixed strategies and equilibrium points 
—can be extended to games with more than two players, and this exten- 
sion we shall discuss in the present chapter. Were these generalizations 
and the resulting theorems the totality of n-person theory, we should have 
presented it in a unified manner for alln 2 2. However, it has long been 
recognized in sociology, and in practical affairs, that between two-person 
situations and those involving three or more persons there is a qualitative 
difference which is not as simple as the difference between 2 and 3. 
Georg Simmel writes, ‘“The essential point is that within a dyad, there can 
be no majority which could outvote the individual. This majority, how- 
€ver, is made possible by the mere addition of a third member.” [1950, 
P. 137,] Again, “The typical difference in sociological constellation, 
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to this phenomenon: the relation of the two-person gam: ear prt 
graming and to statistics has attract 

the known importance of the latter two subjects; mathematicians have 
been intrigued by the two-person theory because it draws on more 


advanced mathematics than does n-person theory; and many workers have 
felt dissatisfied with the present formalization of n-person theory and 
rather than meet the conceptual challenge they have, for the most part, 
withdrawn to other issues. 

Nonetheless, it is the n-person theory which must be of greater interest 
in sociology and economics. It is here, more than in two-person theory, 
that game theory as a part of social science, though not as a part of 
mathematics, will stand or fall. 


7.2 MIXED STRATEGIES AND THE NORMAL FORM 


Back in section 3.7 we arrived at the normal form of an n-person game 
in pure strategies; it will be recalled that it consists of: 


(i) The set J, of n players, 
(ii) The n strategy sets S1, So, °° * , Sn, 


and 


(iii) The n real-valued payoff functions M,, Mo, --+-+, Mn, where 
Mj(s1, 52, ° * * , Sn) is the utility payoff to player 7 when player 1 
uses strategy 51, 2 uses strategy so,.° *. .. and player fl USES $,,. 


In addition it was assumed that each of the players knows the entire 
structure of the game in normal form and that each is governed in his 
behavior by an inflexible desire to maximize expected utility. Beginning 
with this structure and specializing it for n = 2, the discussion of two-per- 
Son games forced us to introduce the concept of a mixed strategy, i.e., of a 
Probability distribution over the set of pure strategies. It seems reason- 
able that if this concept was needed there it will also be needed for n > 2. 
The 8eneralization is practically obvious, but for the sake of completeness 
We shall present it here. 


Pir 5, is a typical pure strategy in S;, then a mixed strategy o; for player i assigns 
@ probability to each si. If we denote this probability by o;(s;), then we must 
a @3(5;) > 0 and the sum of all these quantities over all s;in S; must be 1. Let 


Suppose that player ¢ chooses the mixed strategy 0; and that each of the players 
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7.3 CONSTANT-SUM AND ZERO-SUM GAMES 


In the theory of two-person games a strictly competitive game was 
called zero-sum because it is always possible to choose the zeros and units 
of the player’s utility functions in such a manner that the sum of the two 
utility functions for any strategy choices is zero. This only reflected the 


fact that the interests of the players were strictly opposing; these choices 
of units did not make an 


nd origin for each of the players such that the sum of the 
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‘snot zero-sum. Most economic processes, even if they are games, cannot 
be treated as zero-sum games. 


*7.4 BEHAVIORAL STRATEGIES AND PERF] 


It has surely occurred to the reader that, although these notions of 
strategies, both pure’'and mixed, may be fine tricks for the mathematical 
development of game theory, people almost never pick a strategy on such 
a grand scale. Even for most parlor games, the domain of strategies is 
just too large ever to have been completely given; in all the years that 
chess has been played and analyzed, only a small fraction of partial 
strategies has ever been discussed and listed. Thus, one might wonder 
about a theory of games with a more modest view of the strategy notion. 
One of a somewhat special and limited nature has been examined and the 
results are interesting, for,in a certain important class of games they justify 
a theory based on mixed strategies. 

Instead of giving a mixed strategy to the umpire, a player might specify 
for each of his information sets a probability distribution over the alterna- 
tives of the set. Such a class of distributions—one for each information 
set—is known as a behavioral strategy for the player. Now, although it is 
still a monumental task to list behavioral strategies for most games, it 
may be felt that in effect a player has such a distribution in his mind when 
he makes decisions during a play of the game, and that by making him 
play it many times (after learning has occurred) and observing his choices 
we could get experimental estimates of these distributions. 

A neat way of viewing the difference between mixed and behavioral 
strategies has been suggested to us by Harold Kuhn. One can think of 
each pure strategy as a book of instructions where each page refers to just 
One information set and states exactly what should be done at that infor- 
mation set. The strategy set is a library of such books. A mixed strategy 
chooses one book out of the library by means of a chance device having the 
Probability distribution of the mixed strategy. A behavioral strategy, on 
the other hand, is a book of a different sort. Although each page still 
refers to a single information set, it states a probability distribution over 
the alternatives at that set, not a specific choice. 

It will help in understanding the several points to be made in this sec- 

Hon to have a specific example in mind. Consider the game tree shown 
in Fig. 1, Player 1 has four pure strategies (a, c) (a, d), (b, c), and (6, d) 
Which we may denote by a, a2, a3, and a4, respectively. As we proceed 
; Wwe will illustrate the several concepts in terms of this game. 
It is reasonably clear—and it can be shown—that each mixed strategy 
or a player induces a unique behavioral strategy for him, namely, the 
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Vyas, V4a,4) result in the same distribution over end points; but replac- 
ing a; by the behaviorally equivalent mixed strategy 0; ’= (141, 0a2, 
a3, }ga4) results in a different distribution. The principal feature of 
the mixed strategy o;’ is that it produces a correlation between the choices 
in the first and the second information sets; this is not possible with 
behavioral strategies. The possibility of correlating the choices on differ- 
ent information sets is related to the signaling phenomena discussed below. 
With the payoffs to player 1 indicated in Fig. 1, it is clear that pure 
strategies a2 and a; are dominated by a, and ay respectively. The mixed 
strategy is maximin for 1 guaranteeing him a security level of 14. But its 
induced behavior strategy results in a constant payoff to 1 of 0 units. 
Intuitively it is clear what we are trying to say. Now the only problem 
is to give a suitable general definition. There are several ways this can 
be done, but possibly the simplest is to introduce a concept due to Thomp- 
son [1953 a] which we shall also need in the next section. He defines an 
information set U of player i to be a signaling information set? if we can find 
some other information set, say V, later on in the game tree which also 
belongs to player i and a branch numbered r leaving the set U such that: 


i, There is at least one move in V which can be reached by a path 
starting with the rth branch of a move in U. 

ii. There is at least one move in V which cannot be reached by any 
path starting with the rth branch of a move in U. 


We see that, if U is a signaling information set for player 7, when he is 
at V he finds it impossible to know whether or not he chose the rth alterna- 
tiveat U. For example, the first information set of player 1 in Fig. 1, ice., 
the first move, is a signaling information set since one of the moves in his 
second information set can be reached by a path beginning with the left 
branch of the first move and the other cannot. 

A game is said to have perfect recall if there are no signaling information 


* The term “signaling”’ used here arises, presumably, from a consideration of bridge, 
Which has to be treated as a two-person game with a pair of partners constituting a 
Single Player. The choice(s) of one partner often serves to signal considerable infor- 
Mation to the other (and the term “signal” is part of the vocabulary of bridge), but 
rarely, if ever, is the ambiguity totally removed by such signals. 
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«7.5 COMPOSITE STRATEGIES 


Given that in games with perfect recall the analysis can be at the level 
either of behavioral or mixed strategies without affecting the expectations 
of the players, the question arises whether anything more can be said for 
games without perfect recall. Thompson [1953 a] attacked this problem, 
and he has given an intuitively satisfactory solution. 

Let A; denote the set of all signaling information sets for player 2 (see 

§ section for the definition of a signaling information set). 
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Se pate strategy is defined in the natural way, and Thompson 
[1953 a| has shown that the payoffs associated with an n-tuple of mixed 
strategies are identical to the payoffs of the induced composite strategies. 
Thus “‘ - * * any payoff which players can obtain by means of mixtures 
of pure strategies, they can also obtain by means of composite strategies.” 
[1953 a, p. 275.] 

This result is of considerable importance in the examination of specific 
games without perfect recall. Thompson remarks, “This theorem, 
together with the fact that the normalized form of the game obscures 
signaling strategies, explains one reason why the normalized form of the 
game is not always the best form in which to solve actual games.” [1953 a, 
p. 275.] 

In another paper, which we shall not go into here, Thompson [1953 5] 
uses the notion of signaling strategies to examine a simplified form of 
bridge. 


7.6 COMMUNICATION BOUNDARY CONDITIONS 


With the last section we have completed the discussion we shall give of 
games in extensive form. From now on we shall deal only with games in 


normal form and with an abstraction derived from them, the characteristic 
function. 


Under the assumption that individuals are interested in maximizing 
their expected utility, the aim of present-day game theory is to construct 
4 notion of equilibrium social behavior and to investigate the properties 
of such a concept. The word “equilibrium” indicates that the theory is 
in some sense a static one, and for, at least, practical purposes this must 
be considered a serious limitation of the present theory. Once one has 
fixed upon the normal form, then there can be no consideration of the 

ynamic process whereby the equilibrium states are achieved nor can any 
“arning be admitted at that level. The individual preference patterns, 
48 given by the payoff functions, are assumed to be invariant both in time 
and with the unfolding experience of participating in the extensive form 
ot the Conflict of interest, Furthermore, the strategy spaces are assumed 
a which is certainly false for the many economic processes which 
ject to modification by technological developments and research, 
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It appears that to include it in a generalization of game theory will be an 
exciting major theoretical step. Lacking such a gener | 

tacks have been taken, each of which is unhappily special and 
It is somewhat consoling to observe, however, that we can find an analo- 
gous situation in the physical a ee t continuous processes, but more 
of that later. We shall cite three different approaches to the problem of 
restricted collusion. 

First, there is the one extreme in which any collusion logically possible 
is allowed to occur. This is characteristic of the von Neumann and 
Morgenstern theory of solutions, which will be discussed in detail in 
Chapter 9. In their theory such freedom to cooperate leads to vast 
numbers of “solutions” with no criteria to select among them. They are 
forced, as we shall see, to the ad hoc assumption that in practice there exist 
social standards which determine the solution which actually occurs, but 
no attempt is made to exhibit a theory of these standards. 

Second, there is the other extreme which prohibits any collusion at all. 
Such a condition may not be nearly so limiting as it first seems. Certain 
authors, notably Nash [1951], hold that non-cooperative games are theo- 
retically basic and that cooperative games can and should be subsumed 
under that theory by making communication and bargaining formal 
moves in a non-cooperative extensive game. The resulting normalized 
game would simply enlarge the domain of the various strategies, 
and the payoff functions could be extended to these larger sets in 
the natural manner. Were it possible to give an explicit and intuitively 
acceptable way of enlarging an extensive game so as to include communi- 
cation the argument would be very convincing. Specific criticism is hard 
to make since this view has never been fully elaborated, but McKinsey 
[1952 a, p, 359] has pointed out, “It is extremely difficult in practice to 
introduce into the cooperative games the moves corresponding to negotia- 
tions in a way which will reflect all the infinite variety permissible in the 
“operative game, and to do this without giving one player an artificial 
advantage (because of his having the first chance to make an offer, let us 
Say),”? 

In a way, this conceptual solution to the formalization of preplay com- 
munication simply buries some of the most interesting aspects of the prob- 
€m. One is interested in understanding the forces which lead groups to 
“operate, in the cohesiveness of coalitions over repeated plays of the 
Same, and so on, and we do not want to prejudge these problems by 
| See 6 thers into the extensive form in some special manner. 

Bu whether or not we accept the belief that all games can or should be 
» 1 terms of non-cooperative games, one part of the theory certainly 
devoted to such games, and presumably it should be a “natural”? 
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tions will be termed a coalition structure. For example, the partition 
({1}, {2}, °°» {2}) represents the case where there are No non-trivial 
coalitions, i.e., there is no cooperation. The partition ({1, 2,--., 
n — 1}, {n}) represents the case where the first n — 1 players have joined 
forces in a Coalition against player n. 

Let us now suppose that at some stage of the bargaining in a given con- 
flict of interest the players are arranged in coalitions according to a par- 
tition, which we shall denote by r. Since each player is a ruthless ration- 
alist, we must assume that each is considering various potential changes 
of alliance in an attempt to better his own outcome. These changes may 
ee es prior to the play of the game, or, if it is 
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Mathematically, a rule y can be identified with the table composed of 
the lists ¥(r) for each r. That is, y can be thought of as a function 
which maps each 7 into a class of coalitions. 

It may be worth appending that this third notion includes the other 
two (i.e., no changes permitted and all possible changes permitted) as 
special cases: Define the function y’ to have the property that y’(r) is the 
class of all coalitions in a structure 7 for each r; then no changes are per- 
mitted. Define the function y”’ to consist of all subsets of J, for all values 
of 7, then all logically possible changes are permitted. 

There seem to be at least two major objections to postulating such a 
function y: first, we have no theory to justify it; and, second, we have no 
idea how it is to be determined in particular cases. Without attempting to 
dispute these points, it may be worth observing that such functions seem 
to play a role not unlike boundary conditions in some of the continuous 
flow theories of physics, whereas the given coalition structure r can be 
thought of as an initial condition. The boundary conditions of, say, a 
heat or air flow problem are not given by the physics of the flow process 
but rather are supposed to represent certain salient facts about the par- 
heular physical configuration under study. The form of the boundary 
Conditions is in part determined by the equations representing the flow 
Process, but their detailed selection is arbitrarily given by the scientist 
choiee nS, the analysis, who calls upon one’s intuition to accept his 
phiysios oes a 1s aie sit ease ie oath an art which in 
result of pase anit as gradually become highly sophisticated as the 

ailures and successes in relating theory to data. 
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7.7 CLASSIFICATION OF CONTEXTS FOR /-PERSON GAMES 

Once one leaves two-person zero-sum games, there are serious questions 
of extra-game-theoretic assumptions. ne—the limitations on collusion 
—-was discussed in the preceding section, but it is by no means the only 
one which has been considered in the literature. It seems appropriate 
to specify these here and then to classify the several theories (and sugges- 
tions for theories) in terms of these. This will permit the reader to find 
his way around the complexities of n-person theory a little more simply. 

Most of the past work in n-person theory has supposed that, in addition 


to receiving the payoffs prescribed by the rules of the game, the players 
are permitted to make additional transfers—side 
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divisible and desirable commodity d 
as it is presently meant in n-person theory, for it does not imply conserva- 
tion of utility under reapportionments. The question of side payments 
will be dealt with more fully in section 8.1. 

If the assumption of unrestricted transferability is dropped, then it is 
clear that there is a whole complex of possible cases ranging from perfect 
transferability to none at all. It seems hopeless to try to develop a theory 
covering all cases, and it is much too tedious to examine many of the 
intermediate assumptions, so, as is often done in mathematics, only the 
polar extremes are studied. Thus, a theory will assume either that per- 
fect transferability is possible using some infinitely divisible and desirable 
commodity which is conserved, or that no transferability is possible at all. 
The latter assumption is not lacking in interest, for in many situations 
the mores and legal codes prohibit bribing. 

Still another question of context is whether, when preplay communica- 
tion can occur, the players are able to employ correlated (ie., joint) 
strategies, or whether they must only agree upon individual strategies 
which, although possibly coordinated choices, are not correlated, i.e., 
independent in the statistical sense. To correlate strategies when a game 
is not temporally repeated necessitates preplay communication. 

Only certain combinations of these contexts have received much atten- 
tion in the literature. In a way, the cases of neglect are curious, for it 
is among them that one finds some of the natural generalizations of the 
central notions of two-person theory. The following table presents all 
these possible combinations of contexts, lists the sections where the corre- 
sponding theory is discussed, and states the name of the theory (if one 
exists). Under the column Preplay Communication, the word “‘partial”’ 
Includes, as a special case, no limitations on communication whereas “all” 
refers only to total freedom to communicate. 

Possibly the most surprising omission in the literature is the case of no 
side Payments and partial preplay communication. In section 7.9 we 

'scuss some suggestions toward such a theory, under the assumption 
that communication enables the players of a coalition to choose joint 
Strategies but does not allow intercoalition communication. In particu- 

at; One coalition is not allowed to threaten another. Were this per- 
mitted, then, in all likelihood, interpersonal comparisons of utility would 
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dvantage tO change to a different strateg} so long as he b at 

: layers will not change. ‘Thus, if we look at player 7 it must be 
Pate cannot expect to benefit by employing strategy 1; l of 5; 
Now, it may well be that, if player z eo ild communicate to player j ane 
they agreed upon some joint Ebener al strategies, th co ith benef 
but we have assumed that no collusion is permitted Thus, player 2 can 
only consider changes which are under his direct control, i anges 1n 


his own strategy choice, and it is argued that, if none of these changes 
benefit him, he will not change. If (51, 52, - + * , sn) should be chosen 
so that what is true for player 7 is also true for all the other players, then 
there are no resulting forces to change the given system of strategies, and 
hence the n-tuple of strategies is in equilibrium. To say that player z 
does not benefit by changing his strategy choice simply means that his 
payoff does not increase by any other choice, i.e., 


Mi(s1, 52, Py Say Sa) 2 Misi, 50, daeet Ls! Dini SA) 


Thus, one is led to the following definition: An n-tuple of pure strategies 
(51, 5, °° * » Sn) is an egualibrium point in pure strategies if the above 
inequality holds for every z and for every choice of r; in the set S; of pure 
strategies for player 7. It is not difficult to show that when n = 2 this 
definition is the same as that given earlier in the two-person theory (sec- 
tions 4.8 and 5.7). 

As in the two-person case, there is no assurance in general that an 
equilibrium point exists in pure strategies. It is known that a sufficient 
condition for games to have equilibrium points in pure strategies is that 
they have perfect information [Kuhn, 1953 4], but this is not a necessary 
condition. Dalkey [1953] and Otter and Dunne [1953] have given neces- 
sary and sufficient conditions, but the statement of these results is too 
complicated to warrant inclusion in this book. Birch [1955] has extended 
these results by giving a sufficient condition for a game to have a mixed 
strategy equilibrium point (see below) in which certain players use pure 
strategies, 

Fortunately, the parallel with the two-person case extends to the point 
Where mixed strategies again suffice to establish existence. The above 
definition of an equilibrium point in pure strategies can obviously be 
"eworded to give a definition of an equilibrium point in mixed strategies 

Y the simple formal substitution of “mixed strategies o,” for “pure 
srategies si.’ Nash’s principal theorem [1951] shows that over the 

main of mixed strategies every finite game has at least one equilibrium 

ia result shows that Nash’s definition has one extremely 

is he © property of an equilibrium notion: existence. (The proof given 
pendix 2 generalizes to any n.) 
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ties raises much more serious questions as to its merits than could be raised 
against the minimax concept. These points have already been made 
when we talked about non-zero-sum two-person games, but we shall 
repeat them here. First, if each player were to confine his strategy choice 
to those which are a part of some equilibrium n-tuple, the resulting prob- 
lem faced by each player is again a game. It is a contraction of the old 
game, but it may be just as difficult to analyze conceptually as the original 
game. Indeed, in some sense it may be more difficult for a player to 
analyze it because it crystallizes the difficulties involved. Thus, the 

equilibrium notion does not serve in general as a guide to action. 
Second, one may look upon the notion as possibly descriptive in nature. 
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The complications of non-equivalence and non-interchangeability of 
equilibrium points lead one to ask whether there is not some plausible 
condition which may be added to isolate a single equilibrium point as 


more acceptable than the others. For games with perfect information 
(see section 3.2) Gale [1953] has presented the following idea. He calls 
two strategies equivalent if they yield the same payoff to a player against all 
combinations of strategies for the other players, and he defines one strategy 
to dominate another if the former never results in a smaller payoff than the 
latter and yet they are not equivalent. Within a given game, two opera- 
tions are defined: first, simultaneous averaging of all equivalent strategies 
for each of the players; and, second, simultaneous deletion of all dominated 
strategies for each of the players. Gale’s theorem then states that, if we 
begin with a game having perfect information and if these operations are 
applied recursively in the order given, after a finite number of applications 
a game results which has but one strategy for each player. These unique 
strategies are in fact mixed strategies of the original game, and they con- 
stitute an equilibrium point in that game; this Gale calls the “solution.” 
Against this idea, at least as a plausible descriptive notion, one can 
argue that there is no compelling reason why a player should put equal 
Probability weights over all equivalent strategies. Why not put all the 
weight on the strategy in the equivalence set which yields the best average 
return to his opponents? Or why not a host of other possibilities? Also, 
Can one expect players to go through the averaging and deleting process as 
described? What about other alternatives? For example, in a game with 
three Players, 1 might carry out the process conceptually with only player 
2, holding 3°s strategy space fixed. When that reduction is completed, 
Suppose he engages in the mental process of averaging and deleting with 
Player 3, then back to 2, and then back to 3, etc. It is possible that this 
‘oo might lead to an equilibrium point, and, indeed, it might be preferred 
by the Players to the one given by Gale’s procedure. In other words, we 
object to the arbitrary and implausible nature of the process assumed. 
Onetheless, it does have the distinct merit of being completely symmetric 


Oras the players, and it has the property that, ifn — 1 players are com- 
ey to this suggestion, the nth player might as well go along. It will 
?alled that in connection with the temporal repetition of the prisoner’s 


dj ‘ 
Vemma (section 5.5) we described an analogous procedure. 
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and that cooperation between the players allowed them to correlate their 
strategies so as to randomize between (2, 1) and (1, 2). It will also be 
remembered that, if we plot the set of feasible payoff pairs, we argued 
that the players should restrict themselves to the northeast corner of the 
area—to the Pareto optimal set. Furthermore, it was argued that neither 
player should accept less in the cooperative context than he can guarantee 
for himself in the non-cooperative version of the game by using his maxi- 
min strategy. The set of Pareto optimal points which exceed the maxi- 
min values for each player was called the negotiation set, and this was 
ee. aan of the cooperative game by von Neumann and 
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we have assumed correlated strategies are poss 

n-tuple does not allow for that. To symbolize the correlation 


within a coalition, let r = (71, T2, -- +, 7;) and let o(7,) denote ; 
typical correlated mixed strategy jointly chosen by the players in the 
coalition 7; Then, the aim of the theory will be to characterize those 
pairs [(o(T1), oS) ae o(T;)), (T;, T2, + + + , T;)] which are in 


equilibrium when cooperation is allowed. 

The next question to consider is: what cooperation? Our assumption 
will be that there are limitations upon the contemplated changes of 
coalitions from a given coalition structure 7. Or to put it in another 
fashion, given a coalition structure, not all the possible subsets of players 

| are allowed to have preplay communication and to adopt joint mixed 
strategies. ‘That is, we shall suppose that there are given sociological 
restrictions which can be summarized as a function y of the type discussed 
in section 7.6. 

Consider the following argument for the equilibrium of a pair [(¢(7}), 

-+ ,0(T;)), tT]. If S is an element of V(r), i.e., S is a possible coalition 
change when the players are arranged according to the coalition structure 
t, then § may be expected to form and to disrupt the given pair if each of 
the players in the set S can be made to gain by the change. Thus, a 
hecessary condition for the pair to be in equilibrium is that, for each 
Sin ¥(r) and for each selection of a cooperative strategy a(S), there shall 
) be at least one member of S who does not profit by the change. Since 
we have assumed that there are no side payments, this simply means that 
the payoff as given by the rules of the game is no greater than it was in 
the equilibrium state. 

Let us try to formalize this: The pair [(¢(7T,), - - - , o(T;)), 7] is in 
equilibrium if, for each S in ¥(r) and for each correlated mixed strategy 
9(S), there is at least one player 7 in § such that 


Mj[o(T1),o(T2), + + + ,o(T:)] > Mylo(S), + + +). 


The question is how to fill in the dots on the right. If S' were to form, 7 
Is disrupted, and so, in general, the original strategies ¢(7;) become mean- 
ingless since some of the coalitions T; no longer exist. It is surely not 
*easonable to suppose that the remaining groups of players will carry on 

aa change had not occurred, and, indeed, one may expect the change 
i. — the formation of S to cause those players not in S to reappraise 
se etely their collusive arrangements. SoS will not know the reaction 

contemplated actions, but if the players in S are conservative they 
€xpect the worst—they will expect all the remaining players, —S, to 
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then we claim that the resulting pair 1s certainly in eq 
mary, then, the definition reads: A pair mes), - °° ,0 oe 
(for a cooperative game without side payments but wv correlated 
strategies) if, for each S$ in ¥(r) and for each mixed strategy o(S), there js 
at least one player j in § vulnerable to an attack by —S, ie., —J has a 
strategy ¢(—S), say, such that 


Mjlo(T1), - - - , o(T;)] > Mjlo(S), o(—S)}. 


When we reach Chapter 10, we will see that this notion is conceptually 
similar to y-stability theory for cooperative games with side payments 
In addition, it is a generalization of the negotiation set, as we now show. 


P Since we are considering cooperation, r = ({1, 2}) 
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has been done on this concept 


can be offered as to its properti 

For cooperative games with 
strategies are prohibited, a simila 
course, is that among the admissi 
uncorrelated) changes of strategies w 
least one member of the coalition. 


an uncorrelated mixed strategy n-tuple, 7 a coal 


the class of coalitions capable of disrupting -stable 1 


and only if there exists a coalition S in ¥(r) whose members can so coordi- 
nate their choices of mixed strategies (without correlating them) that each 
improves by the defection when the remaining players hold fixed to their 
original strategy choices. Such a pair is y-stable if this condition does 
not hold, i.e., if for every coordinated choice of mixed strategies by $ 
there is at least one member of S who fails to improve by the defection. 
It is easily seen that, ifr = ({1}, {2}, , {n}) and if y admits no 
changes from 7 (which describes the non-cooperative case), then a neces- 
sary and sufficient condition that [6, 7] be y-stable is that 6 be a Nash 
equilibrium point; in other words, the definition is a generalization of 
Nash’s notion. One might wish to generalize this notion to allow the 
members of a disrupting coalition S to correlate their strategies. 
Farquharson [1955] and Shubik [1957] have given this definition for 
the special cases where r = ({1}, {2}, - - - , {n}), and where y({1}, {2}, 
‘++, {n}) consists of all coalitions with & or fewer players. In other 
words, they consider simultaneous changes of & or fewer players, whereas 
Nash considered only the case where k = 1. Shubik used the term 4-sta- 
bility for this notion (the same term will be used in section 10.2 for a some- 


what different but related idea), and Farquharson speaks of equilibrium 
Points of order k. 


7.10 suMMARY 


For the most part, this chapter has been concerned with the extension 
of some concepts from two-person theory to games with more players and 
With some new distinctions which seem necessary. Among the ideas 
*xtended were: mixed strategy, zero-sum game, and equilibrium point. 
aia distinctions were largely in the realm of extra-game-theoretic 
alia, S including side payments, coalition structures, limitations on 
a (communication boundary conditions), correlated strategies, 

: able utilities, and interpersonal comparisons of utility. 
ton an two-person theory, a mixed strategy is simply a probability distribu- 
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only when a player remembers (in the sense of information sets) his previ- 
ous choices; he need not, however, recall the choices made by the othe 


players. Thompson has extended this result to general games by showing 
over which information sets mixed strategies must be used and those for 
which behavioral strategies suffice to achieve the same expectation as 
with mixed strategies. 
A second central notion from two-person theory which was generalized 
Is an equilibrium point: a collection of strategies, one for each of fhe n players, 
| ie ae . . increase his payoff by changing his strategy 
Be bos ot 1 ers hold theirs fixed. Nash has shown that every 
ie One Tiixed strategy equilibrium point, but not neces- 
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cooperation; and, at the other extreme, 
which is the assumption made by 
their theory of solutions (chapter 9 

In addition to these communication | 
distinctions are needed in the cooperativ 
coalition use correlated (joint) strategies or 
among the players be effected or not?. If so, does utility possess that 
special feature, known as unrestricted transferability, which makes it behave 
like money, i.e., is it infinitely divisible and is it conserved when 
ferred? Depending upon which choice is made in each case 


theories arise; these were charted in a table in section 7 


Of the eight cases, only three have been extensively studied in the 
literature. Of the omissions, the cooperative case in which side payments 
are prohibited seemed most surprising, for that appears to be a natural 
generalization of two-person theory. In the final section, two definitions 
(depending upon whether correlated strategies are permitted or not) for 
the no side payment case were proposed but not explored. These are 
both based upon an assumed boundary condition y. The general idea 
is that a pair consisting of strategies (correlated or not, as the case may be) 
and a coalition structure will be in ‘“‘equilibrium” provided that at least 
one person in each of the admissible coalition changes (as given by yw) is 
placed in danger of suffering a loss by participating in the change. It 
was shown that the uncorrelated case is a generalization of Nash’s equilib- 
rium point theory and that the correlated case is a generalization of the 
Concept of a negotiation set. 
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When we speak elliptically - Sea 

exist a set of utility scales for the players and an infinitely divisible he 


geneous commodity such that the changes in individual utilities whicl 
5 ; Ret : : ilitie 
beat when this commodity 18 transferred conserve the total utility sur. 
In terms of these utility scales it is meaningful to talk about the gis 


utility accruing to a coalition and about a partition of this utility sum 
among the participants of the coalition. Of course if utilities are based 
on other scales (i.e., different origins and units), the utility changes which 
will occur with transfers of physical goods need not necessarily conserve 
the utility sum. 

The assumption of a transferable utility is reasonable if monetary sidé 
payments are allowable and if each player’s utility for money is approxi- 
mately linear in the range of potential payoffs of the game. Although 
these conditions are sufficient they are not necessary. Money is not the 
only commodity which can serve the purpose of transfering utility. For 
example, labor may play a similar role in some contexts (e.g., side pay- 
ments in the husband-wife coalition may consist of the transfer of house- 
hold chores). Also, if monetary side payments are allowable, it is mathe- 
matically possible— but not very probable—that money transfers may 
be made which conserve utility even though each of the players’ utility 
for money is non-linear. 

To each outcome of a game—an outcome which might involve mone- 
tary as well as non-monetary consequences—each player can associate an 
equivalent pure monetary return in the sense that he is indifferent between 
this outcome and this amount of money. Monetary equivalences for out- 
Comes do not constitute a suitable utility indicator unless the players’ 
utility functions for money are linear in money. In this case, the set of 
utility scales which agree with the scales of monetary equivalences are 
specially convenient, for in terms of these scales we can speak elliptically 
about transferring utility with conservation. We emphasize again that 
this in no way implies an interpersonal comparison of utility. 

a suggest that, if the above remarks are kept in mind, the reader will 
aN astray in the sequel by interpreting all payoffs in terms of mone- 
ee alences and assuming each player’s utility for money is linear in 
< y. One does not have to assume that money has the same “‘worth”’ 
Whatever that means—for each player. 
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Still, one might object that, if each time the coalition did its best one of its 
players did no better, or even worse, than he could have alone, it might 
indeed be difficult to persuade him to remain in the coalition. As long 
as the payoff is in some sort of transferable commodity which results in the 
transferability of utility, this is no problem. ‘The other members of the 
co. may simply extend him side payments in order to keep him in 
t aS The extent of the side payment is a difficult problem of 
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ns involved in determining v are generally very complicated 
however, does not weaken the power of th 


functions! mo | 
The function v is not without certain restrictions; i DE SHOWN 1 
in the zero-sum Case it satisfies 


(i) vn) = 0, 
(ii) (5) = —v(—S), for all subsets $ of Ip, 
(iii) v(¢) = 0, where ¢ is the empty set, 


and 
(iv) If R and S are any two disjoint subsets of players, 
v(RUS) 2 v(R) + o(S).2 


(Note that, given condition ii, conditions i and iii are equivalent.) The 
first two conditions simply reflect the zero-sum character of the game. 
The third is a formal statement of the “‘obvious fact” that the subset 
involving no players neither loses nor wins anything. The important 
condition is iv, which, on reflection, is extremely plausible. It says that 
the whole does not obtain less than the sum of its parts, or, put another 
way, a coalition composed of the disjoint sets R and S can do anything R 
and § can do as separate coalitions, and possibly more. 

The function v has been named the characteristic function of the zero-sum 
game from which it was derived. 

It is interesting and important that any real-valued set function v satis- 
fying conditions i through iv is the characteristic function of a zero-sum 
game. That is, given any such y, it is possible to construct a game in 
normal form which has as its characteristic function the given function v. 

The extension of the concept of the characteristic function v to non-zero- 
sum games is completely straightforward. For any coalition S we let 
»(S) be the maximin value (optimal security level) of the coalition S in the 
non-cooperative two-person non-zero-sum game S versus —S (see 
Chapter 5). Of course, the function v no longer has two of the properties 
of the zero-sum case, v(I,) = 0 and o(S) = —2(—S), but it does satisfy 


| the other two, namely: 
) 2(4) = 0, 
li) IfR and S$ are any two disjoint subsets of players, 
v(RUS) 2 v(R) + 2(S). 
PPro 
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If, in addition, the game is constant-sum (not ne 
not excluding that case) then: 


(iii) o(S) = v(Zn) —v(—S), for all subsets S of J. 


Of course, if we assume the game is zero-sum, then o(/ 0, and (i 
becomes the old condition (ii). 

When we use the term characteristic function we shall mean any real. 
valued set function satisfying (i) and (ii), for it is again true that for each 
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that the given situation is a game in normal for 
formity to present usage, we shall refer to a set of play nc . 
istic function defined over the subsets of players as a game, and whenevel 


it is necessary to avoid ambiguity we shall add “‘in characteristic function 
form.” 

This last discussion suggests that one interesting and comparatively 
easy way to study people’s responses to a characteristic function experi- 
mentally is to present the game in terms of the payoffs to coalitions, i.e., 
in the form of the characteristic function. One can then observe the steps 
of coalition formation, the resulting coalitions, and how the spoils accruing 
to a coalition are divided among its members. Exactly such an experi- 
ment has been performed; it is discussed in section 12.3. 

Our next step is to divide characteristic functions into two classes. It 
is conceivable that there are games in which no coalition of players is 
more effective than the several players of the coalition operating alone, in 
other words, that for every disjoint R and 5S, 


v(RUS) = o(R) + v(S). 


Such games are called inessential; any game which is not inessential is 
called essential. It is not difficult to show that a game is inessential if and 
only if the total payment to the set of all players is exactly the same as the 
sum of payments to all the individual players, i.e., 


vIn) = ) o({i}). 
t=1 
Since nothin 


4 § is gained by forming coalitions in inessential games, it is 


an that we cannot expect any theory of coalition formation in that case, 
ne so we shall be concerned only with essential games from now on. 
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Suppose that one charac : 
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v($) = o'(S), for all subsets S of J,; 


then the two characteristic functions differ only in the unit whereby we 
measure the utility. One example would be to transform a characteristic 
function originally in dollars to one incents. It is clear that such a change 
of unit cannot possibly affect the strategic character of the game to 
rational players. 

Next, suppose that we have a game with characteristic function v and 
suppose that, in one way or another, each player 7 is paid (or is caused to 


pay, depending upon the sign) an amount q; prior to the play of the game. 
Certainly these payments do 
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It may not be obvious, at this point, that this definition of equivalence 1s 
a suitable one, and that no further grouping is needed; but the results we 
shall cite near the end of section 9.1 show that it is adequate, at least for 
the von Neumann-Morgenstern theory of solutions. * 

The relation of S-equivalence, defined over the set of all characteristic 
functions on n players, can be easily shown to satisfy the conditions of an 
equivalence relation; that is, one can show: 


(i) It is reflexive: for all v, v is S-equivalent to », 
(ii) It is symmetric: if v is S-equivalent to v’, then v’ is S-equivalent to 2, 


and 


(iii) It is transitive: if v is S-equivalent to v’ and if v’ is S-equivalent to v”, 
then v is S-equivalent to v’’. 


When a relation is reflexive, symmetric, and transitive it behaves in 
very much the same way as the notion of equality (or sameness), and in 
particular it divides the elements of the set over which it is defined into 
non-overlapping subsets such that the elements within any one set are all 
equivalent and any two elements from different subsets are not equivalent. 
These subsets defined by an equivalence relation are called equiva- 
lence classes. Within each set the characteristic functions entail the 


Same strategic considerations, and those in different sets require different 
Considerations. 


Assuming that S-equivalence is the appropriate grouping of character- 
Istic functions, we must next confront the task of selecting one representa- 
uve from each class in terms of which we shall construct our theories and 
‘xamples. Two suggestions have been offered, each of which has certain 
advantages, primarily in the simplicity of stating certain games and cer- 
‘ain definitions. The principle behind both of them is the same: it is 


Possible to require that part of the representative characteristic function 
4 . 

Experimental data will be cited in section 12.3 in which two S- 
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that the arguments leading to the definition of S-equivalence are not 


— ‘ et another interpretation is possible if the underlying knowledge postulate of 
Cory is weakened (see section 12.4). 
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One of the first advantages—px 
normalization is its emphasis on the 


tion theory and the concept of a probability measure over the subset of a 

finite set. Let us place side by side the conditi ‘or a characteristic 

function in 0, 1 normalization and for a probability measure: 

0, 1 Normalization Probability Measure 

j. v is a non-negative real-valued set p is a non-negative real-valued set 
function. function. 

ii. v(Z,) = 1. pU,) = 1. 

iii. v(d) = 0. p(>) = 0. 

iv. If R and S are disjoint subsets of Jn, If R and S are disjoint subsets of J, 
»(RUS) > o(R) + 0(S). p(RUS) = p(R) + p(S). 

v. o({i}) = 0. 

vi. if the game is constant-sum, v(S) It follows from (ii) and (iv) above that 
= 1 — v(—S), for all subsets S. p(S) = 1 — p(—S), for all subsets S. 


Although the resemblance between v and p is marked, there are differ- 
ences, the most important being the inequality in the former and the 
equality in the latter for (iv), and the lack of the fifth condition for p. We 
cannot have p({i}) = 0 for all i, for, if this were the case, by repeated 
application of (iv) we could conclude p(J,) = 0, which contradicts con- 
dition (ii). We shall return to this correspondence again when we try to 
characterize the principal problem of n-person game theory (section 8.6). 

This comparison suggests that the study of general games by means of 
characteristic functions could have been entitled the study of ‘‘finite 
superadditive measures.’? Conditions (i), (iii), and (iv) above suggest the 
name “‘superadditive measure,” and condition (ii) simply means that the 
measures are normalized, as in the theory of probability. However, con- 
dition (v), v({7}) = 0, is most unusual in measure theory. It is worth 
Pointing out, at least for the mathematician, that we may drop this condi- 
tion when we are studying theories invariant under S-equivalence, since 
under the transformation 
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strategy: 
function: 


od) = 9, — o({1}) = 0( (21) 


Thus, although the normal form is di 


Compare this analysis with a calculation « 
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istic function is perfectly symmetric, reflecting ; 
two players. It would be difficult to deny that a: 
tion based upon its characteristic function 
governing aspect of the normal form. 

This example is merely a special case of the more general observation 
that characteristic functions tend to be inadequate representations of non- 
constant-sum games. ‘There is considerable question whether the num- 
bers so assigned to coalitions can usefully be conceived as the “strength”’ 
of the coalitions. Certainly, the above example suggests that they cannot. 

Such remarks do not invalidate our earlier comment that, if one wishes 
to represent coalition strength in a conflict of interest by a single valued 
numerical function, it is intuitively plausible that it should satisfy the two 
conditions of a characteristic function. They do, however, raise serious 
doubts as to the formal procedure, now current, of obtaining these num- 
bers for non-constant-sum games. 

Unfortunately, it is also a question whether characteristic functions 
ever adequately represent a game, at least insofar as a descriptive theory 
isconcerned. Let us restrict our attention to zero-sum games. It will be 
recalled that the characteristic function is derived by supposing S and 
—S form opposing coalitions and v(S) is the value to the “player’’ S of the 
two-person zero-sum game. A theory based on this characteristic func- 
tion seems reasonable if it is supposed that, whenever S' forms, —S also 
forms. But does that not prejudge the theory by demanding that all 
Conflicts of interest always reduce to two opposing coalitions? Certainly, 
this is not observably true, and, to the extent that the formulation of the 
game situation demands it, the formulation is probably inadequate for 
social science. In actual fact, the several theories now current do not 
necessarily stipulate that the game reduce to two opposed factions. Yet, 
the calculations are based on the characteristic function, which means that 
the model assumes each coalition takes the most pessimistic view of the 
°pPosition it will face. The players of these theories are conservative in 
the extreme. Presumably, an adequate descriptive theory will incorpo- 
Tate the expectation of each potential coalition as to the reaction of the 
remaining players if that coalition should actually form. How these 
“xpectations should be calculated is far from clear. 

We shall not deny that we feel these two groups of criticisms are very 
Setlous indeed, and as a consequence we have limited faith in the ability 
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8.6 IMPUTATIONS AND THE CORE 


So far we have dealt with only one ingredient of the n-person game with 
side payments: the strength of the different coalition possibilities as meas. 
ured by the characteristic function. Distinct from this, though pre. 
sumably influenced by it, are the payments that the individual players 
finally receive. Since we have assumed payments in terms of unre 
strictedly transferable numerical utilities, the direct payments as prescribed 
by the normal form of the game and any side payments resulting from 
agreements arrived at during coalition formation can all be added up for 
oA ae es i this summary payment for player #; then th 
which we may write a n players forms an n-tuple of real numbers 
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of our principle of individual behavior [po tulate (ix). section 3.6]. We 
are therefore led to impose the condition of indi 
(i) v({z}) < x;, for every 7 in /,,. 

The second condition results if we use an analogous argument for the 
set of all players, but this argument is far less acceptable, as we shall see. 


It runs that rational players, no matter how they constitute themselves 
. | 
into coalitions, should not accept a total payment \ x; less than v(Jn), 


Pt med 
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for, if they did, each player could be made to gain without loss to the 
others. For example, each could be made to gain the arnount 


[ oz) ~* ma xi|/n. 


If this argument is accepted, we must then require that v(Jn) < > Xie 


tinIn 
Since v(/,,) represents the most that the players can get from the game by 
forming one grand coalition, it is impossible for x; to exceed v(J,,), so 


tin In 
this condition, Pareto optimality, amounts to: 


(ii) 2 Bie al), 
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Any n-tuple x of real numbers satisfying (i) and (ii) is called an impu- 
ation of the game with characteristic function v, and it is held that any 
equilibrium payment must be selected from among the imputations. 

The controversies about imputations are entirely restricted to the second 


condition. First of all, it is clear that if > x; is less than v(J,) each of 
tin In 
the players can be made to profit without loss to the others. Yet, it is by 
a means clear that players will be able to reach agreements effecting 
this. The argument leading to group rationality, i.e., Pareto optimality, 
18 an attempt to extend the postulate of individual rationality to groups of 
Players; however, the notion of group rationality is neither a postulate of 
n€ model nor does it appear to follow as a logical consequence of indi- 
a rationality. One might attempt to argue it as follows: Any impu- 
ation which does not add up to o(J,) will not be in equilibrium because 
individuals will see that their own position can be bettered, and, 
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iii) 0(S) < > x; for every SU 
[Observe that (iii) includes both (i) and (ii) as speci: 


It is hard not to say Yes; however, as We shall F : lead si trouble, 
So one is led to look hard for a defense to keep condition ( vhile dropping 
(iii). The following is a weak, but possible, argument Sup . 
imputation such that the sum of payments to players in S i less than v(S), 
The above argument was that the members of § would not be content with 
x because they can command 2(S) and therefore each can receive more 
than x; But suppose that the coalition —.S forms and threatens to dis- 
rupt § through attractive offers to one or more of its members if S does 
not accept the imputation x. Such pressures seem at least a possibility 
for all coalitions save J,,, for it can be threatened only by the empty set. 
Thus (ii), but not necessarily (iii), should hold, according to this argument. 
The set of n-tuples satisfying condition (iii) has been termed the core by 
Gillies [1953 b] for the very good reason that these n-tuples should be 
included in any definition of equilibrium we propose. The difficulty in 
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But from our previous argument, tl 


be an equality, hence it follows that 
If we choose § to be a single player, der We see v({2}) = "xE'80 We uaa 
shown that for every S 
x” 
v(S) = Ms (71) 


That is, we have shown that, if the core of a constant-sum game is non- 
empty, the game must be inessential. Therefore, if the core is taken as the 
definition of equilibrium, all essential constant-sum games lack equilib- 
rium n-tuples of payments. We may fairly conclude that we are in 
serious trouble if we accept the full consequences of the argument leading 
to condition (ii); up to the present we can find in the literature three ways 
to avoid or bypass these troubles. 

The most obvious way to avoid them is not to impose condition (ii) and 
therefore not condition (iil). This tack has not been fully explored, but 
as we shall see in section 9.7 Shapley and Gillies have examined the effect 
of dropping condition (ii) within the framework of the solution theory of 
von Neumann and Morgenstern, and they have established that for solu- 
tion theory it makes no difference. Nonetheless, since the solution theory 
was presumably devised as a way of keeping condition (ii) and bypassing 
condition (iii), one can raise the question whether an entirely new equilib- 
rium theory different from solutions should not be devised when condition 
(ii) is omitted. 

The resolution offered by von Neumann and Morgenstern, which 
retains condition (ii) but not condition (iii), involves the idea that it is 
not an imputation which is in equilibrium but rather aset ofthem. ‘These 
sets of imputations—which are called solutions—possess certain properties 
of inner stability which we shall discuss in the next chapter. 

The third major approach to bypass the difficulty of condition (iii) is to 
demand that it hold only for certain coalitions S. Luce [1954, 1955 a] has 
argued that, if the game model included appropriate sociological assump- 
tions, in general the contemplated changes from an equilibrium state 
would be restricted (see section 7.6) and that, therefore, condition (iii) 
fe soibave to hold for all coalitions S. Milnor [1952] in his sug- 
gested reasonable outcomes has, in effect, allowed condition (ili) to be 
Violated for certain coalitions S. These ideas are discussed in detail in 
Chapters 10 and 11. It must be emphasized that these resolutions of the 
difficulty say, in effect, that there are some significant restrictions in 
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(i) x; 2 0, for all ¢ in Tay 


and 


In other words, the set of imputations corresponding to the 0, 1 normalization 
is identical to the set of all probability distributions over the elements of Tig it, 


an imputation is a distribution to the individual players of the total payments 
received by all of the players. 


Tf x = (x1, x2, ° ++ , xq) is a probability distribution over J, then it is easy 
to show that the set function 
x(S) = Dx 
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the players are allowed and where utility act 

freely transferable in any amount up to the 3 

its numerical value whenever a transfer occurs 

central concepts are introduced: characteristic fi 
If a subset S of players forms a coalition, it 

for the remaining players, —S, also to form a 


two-person game, S versus —S, results. ‘The maximin value for 


Sie 


§ can be computed; let it be denoted v(S). These numbers, which are 


defined for every subset of players, form what is called the chara teristic 
function of the game. It can be shown to satisfy: 


(i) 2(¢) = 9, where ¢ is the empty set, 
and 
(ii) If R and S are disjoint coalitions, (RUS) 2 0(R) + o(S). 


By considering the effect of changing the unit of the characteristic func- 
tion and of making preplay payments to or from the players, we were led 
to a concept of strategic equivalence. Formally, two characteristic 
functions v and v’ on the same set of players are said to be S-equivalent if 


there exist a positive constant ¢ and constants aj, j= 1, 2,2 5 eR 
that 


v'(R) = co(R) + ) Qi, 


tink 


for every coalition R. This relation partitions the set of all characteristic 
functions on n players into non-overlapping subsets of equivalent charac- 
teristic functions. It is held that any equilibrium theory should yield 
corresponding results for all members of the same subset; thus, it is suff- 
cient to examine just one representative from each. ‘The one chosen for 
essential games satisfies v({7}) = 0 and v(I,) = 1, and it is called the 0, 1 
normalization of the members of its subset. 

These ideas capture something of the strategic potentialities of the 
several coalitions, but they fail to deal with the returns to individual 
Players. It was argued that the special assumptions about utility allow 
Us to summarize the total payment to player 7 by a single number x;. 
From the individual rationality assumption, it follows that 


(i) x; > v({i}), for each player 2. 


i was also argued that for the same reason the set of all players cannot be 
eae to accept less than the total payment available, i.e., the group as 
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(iii) b x; > v(S), for all coalitions S. 

tin S 
The set of imputations satisfying (iii) is called the core of the game 1, 
Many games, including all essential constant-sum ones, have a vacuoys 
core; thus, serious doubt is cast on the appropriateness of (ii). What 
happens to the several theories when (ii) is not assumed will be discussed 
as they are presented. 


chapter 9 


SOLUTION > 


9.1 THE VON NEUMANN-MORGENSTERN 
DEFINITION OF A SOLUTION 


In the published literature of n-person games one definition, based on 
characteristic functions and imputations, has received primary attention; 
this definition, introduced at length by von Neumann and Morgenstern 
[1947], was offered as the “‘solution” to the n-person cooperative game— 
indeed, it was given the name solution. Following their exposition, we 
May first suggest the idea by an example. It is not difficult to see that the 


0, 1 normalization of an essential constant-sum three-person game is 
unique, and that it is: 


(i) Baa) = o({2}) = v({3}) = 0, 
(ii) Hi 1, 2}) = v({1, 3}) = v({2, 3}) = 1, 
(iii) pat, 2 2or = 1. 


Wore: Properties (i) and (iii) are specifically required by the 0, 1 nor- 
— (section 8.3), and (ii) follows from these and the constant-sum 
ao Suppose, for the moment, that the coalition {1, 2} forms. 
es . to the characteristic function it may command a payment of 1, 
€ players 1 and 2 have symmetric roles (in the sense that if we 
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of F; rather he must offer something like (0, 34, +4 “ ee 
bers of the coalition receive equal incremental benefits. ; eunce both wou d 
benefit, the change should occur. We say that (0, 34, 14) a 
(%;, %, 9) with respect to the coalition {2, 3} because both 2 and 3 hot 
fit and the imputation can be enforced. It is easy to see that for any 
imputation in F there exist imputations not in F which are better for two 
individuals and which can be enforced by them if they form a coalition, 1¢, 


each imputation in F is dominated by one or more imputations not in F. 
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Clearly the argument which led to calling F stable is dynamic in nature, 

and yet we conceive of game situations as one-shot affairs. How, then, 

can we possibly formalize such reasoning? ‘I'he answer 1s that we do not; 
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we only characterize those sets of imputations which ultimately possess 
the inner stability relative to the notion of domination. It should be 
remembered that, although the game itself is played but once, there is 
nothing which prevents the pregame negotiations among players (which 
are not now formalized in game theory) from having such a dynamic 
quality. 

Yon Neumann and Morgenstern give the following definitions which 
generalize those of the example. Let the game have characteristic func- 
tion v; then an imputation y = (y1, y2, °° * » Jn) dominates an imputation 
x = (x1, x2, °° * , Xn) with respect to the coalition T provided that 7 is not 
the empty set and the following conditions are met: 


and 


(ii) y; > x, for every i in T. 


The first condition admits y as dominating x only if y is feasible in the 
sense that the members of 7 can expect to have the amount prescribed by 
y to distribute among themselves. The second condition says that every- 
one in T strictly prefers y tox. The set 7 in such a domination is called 
an effective set. 

Observe that exactly this condition was met when, in the example, we 
said that (0, 34, 14) dominates (14, 14, 0) with respect to the set {2, 3}. 

€ say that an imputation y dominates x (without specific reference to 
the effective set) if there is at least one effective set T such that y domi- 


pales X with respect to J. It is not difficult to see that all the logically 
Possible cases can arise, namely: 


ae dominates x, but x does not dominate y. 
7 Yy dominates x, and x dominates y (with respect to non-overlapping 
es Itions, of course); e.g., in any five-person game with o({1, 2}) = 
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i, If x and y are imputations in A, then neither x nor y dominates the 
other. 

li. If z is an imputation not in A, then there is at least one imputation 
x in A which dominates z. 


It must be emphasized again that the definition of solution in no way 
precludes the existence of imputations not in A which dominate one, 
or several, of the imputations Memeneare in A. This occurred in the 


example we discussed, We shall return to this point, which is not with- 
out complications. 
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Now, there is a possible converse to the theorem that dominance is preserve¢ 
fer S-equivalence, namely, that, if two games have imputation spaces whi h 55 
ae phic under domination, the two games have characieristic functions which 
Ee apeqoivelent. This much more subtle result has been McK 
1950 b], to hold for zero-sum games. P, 
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9.2 SOME REMARKS ABOUT THE DEFINITION 


Before discussing the mathematical results which have been obtained— 
and some which have not—concerning solutions, certain questions about 
the intuitive adequacy of the definition must be considered. The notion 
of “dominance with respect to a coalition 7”’ is the conjunction of two 
quite distinct notions: The first condition, v(T) 2 ve yi, formalizes the 

iin T 

idea that “‘y is ‘feasible’ with respect to 7,” and the second one, yi > *i; 
for all i in T, states that “‘y is ‘better’ than x so far as the members of 7” 
are concerned.” Of these two, there seems little reason to question the 
second but the first is open to some debate. The criticisms of feasibility 
are not so much an objection to the definition of a solution itself as to the 
representation of a normalized game by its characteristic function—a dis- 
cussion begun in section 8.5. These objections evaporate completely in 
the special case where only the characteristic function is given, where the 
payoffs are assigned to coalitions, not to individuals. 

For zero-sum games, it can be argued that if the coalition T forms it can 
never enforce more than v(T) since the remaining players, who are 
assumed to be rational and unconstrained by any social limitations, will 
certainly form the coalition —7. In other words, for zero-sum games 


and unlimited collusion, an imputation y with o(7) < ) yi is certainly 
iin T 
not feasible. If, however, we drop either of the conditions—either zero- 
sumness or unlimited collusion—then the condition of feasibility is subject 
to doubt. Furthermore, descriptively it is doubtful even in the zero-sum 
Case, for the coalition may realistically count on the remaining players not 
'o agree on a division of v(—T) and so not to form the coalition — 7, 
pebider, then, the role the solution concept may play. A descriptive 
nga be concerned with economic, military, and social conflicts of 
oie and certainly not all of these reduce to the opposition of two 
el St Yet if limitations on collusive arrangements prevent the 
aes pepeen rd: from forming, there is no a priort reason why the 
ies. et be limited to receiving 2 Ts Thus, it is questionable 
ei : theory of solutions can possibly be descriptive; of course, it 
con te. aya normative role. But, as we pointed out, the feasibility 
$ subject to doubt for non-zero-sum games. It is perfectly 
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It should be noted that solutions are only concerned with Imputations 


and that they do not specify the coalition structures associated with each 
of the imputations, or even the set of all possible coalition structures associ. 
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_ anipae a priori Criticism is severe and not wholly idio- 
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q countab 
g solution below. om * 

The second main point is that, aside from thx 

‘n a solution, most games that have been stud 
This possibility was suggested by our earlier observations that 
be imputations not contained in a solution which dominate imputation: 
of that solution and that the relation of dominance is not transitive 
Even in the three-person zero-sum game the solution F is not unique: I 
we choose ¢ to be any fixed number in the range 0 < ¢ < 4, and if we 
let x; and x2 be chosen so that neither is negative and x, + xe+tc = 1, 
then the set of imputations of the form 


Je infinity— of imputations.’ We sha! 


(x1, X25 é), 


where x1 and x2 vary, forms a solution for each value of c. We shall 
denote this solution by F3(c), where the numbers 3 and ¢ indicate that the 
fixed amount ¢ goes to player 3. Equally well, the two sets of imputations 
obtained by moving ¢ to player 1 and to player 2, i.e., the sets 


(c, x2, x3) and CORA? 


are also solutions, which we shall call Fi(c) and Fo(c). Since there are 
solutions for each possible value of c in the interval from 0 to 44 (but not 
including 14), there is a continuum of solutions, each of which contains a 
Continuum of imputations. Indeed, every possible imputation for the 
constant-sum three-person game is included in at least one solution! 

Therefore in the case of the essential three-person game we have an 
embarrassing richness of solutions.” [McKinsey, 1952 a.] 

This abundance is not restricted to the three-person case. 

The immediate question is how to interpret these solutions. Von Neu- 
mann and Morgenstern divide the discussion into two parts. First, they 
7 that, of the several solutions, the one which is accepted depends upon 

Standards of behavior” which are moral or conventional rules imposed 
oak Thus, they say, if society accepts discrimination, one may 
i es of the type F;(c) where the position of c in the range 0 to 14 

a by the degree of discrimination tolerated by the society. 

‘ oe ¢ fixed, there is a question how the other two players will divide 
ie as 1 —% and this is a problem in bargaining which depends 

relative bargaining abilities of the two players. They do not 


1 A A 
oe Se Contains a countable infinity of elements if we can enumerate them, i.e., if 
integer Peak of a first, a second, and so on in such a way that each element has an 
reads sociated to it. Shapley [Kuhn, 1953 a] has conjectured that no solution ever 


4 countable infinity of imputations. 
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And 


| A] in its entirety 
* + + the above considerations make it even more clear that ae sadi- 
is a solution and possesses any kind of Btability—but aes hat several solu- 
vidually. The circular character - - - makes it plausible also a ee 
tions [A] may exist for the same game. I.e., several stable ; 7 pane 
may exist for the same factual situation. Each of Sag wo Ss foie 
stable and consistent in itself, but in conflict with all others. [ »P 


The full flavor of their argument is hard t 
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9. 

eller and two buyers. This example is particularly suited to illustrat 
urposes because it is sufficiently simple to be analyzed b 

ense”” economic argument. Its game theory solution has been presented 
S . ay 2 gine FS 

by von Neumann and Morgenstern [1947, pp. 564-573), anc lso 


e 


possible to analyze it in detail in terms of the other theories which have 
been offered (Chapters 10 and 11). A comparison of the common 
aspects of these several analyses, and of their differences, is instructive. 
Let the seller be called player 1 and the two buyers 2 and 3. We shall 
suppose that the seller is in possession of a single indivisible commodity 
which he is willing to sell for a price. Furthermore, we shall suppose that 
among the players there is an infinitely divisible and transferable com- 
modity, which we shall call money, in terms of which the object is priced. 
Let the players 1, 2, and 3 value the object a, 6, and ¢ units of money 
respectively. There is no loss of generality if we assume that b < c, and 
we may as well assume that a < 5, for otherwise player 2 is not really a 
part of the market (since, if a > 6, he values the object less than the 
person who already possesses it), and so there would be no point in treat- 
ing it as a three-person game. Because of the different valuations of the 
object, it is clear that this is a non-constant-summ game. The question to 
be answered by any analysis we perform is what coalitions, if any, will 
form and what exchanges of money may be expected to take place. 

Let us first determine the characteristic function of the game. Since 
player 1 has the object and is not forced to sell, he can guarantee himself a 
value of a—the value he places on the object. Since the other players are 
not forced to buy he cannot be certain of a value greater than a, so 
o({1}) = a, Equally well, since the other players do not possess the 
object and cannot be certain of getting it, they cannot be certain of a value 
in excess of 0; but since they are not forced to participate in the bargaining 
they can be certain of 0, so v({2}) = v({3}) = 0. If the coalition {1, 2} 
Were to form, then the object would be in their possession and its worth to 
Player 1 is a and to player 2 it is b. ‘The object cannot be removed from 
the coalition by a payment less than 6, for, if player 1 were tempted to 
sell it for less, player 2 would pay him that much and continue to keep the 
object in the coalition. On the other hand, the coalition cannot assure 
Mself a value greater than b, so v({1, 2}) = 6. Similar arguments show 

t the value of the game to {1, 3} isc, to {2, 3} it is 0, and to all three 
Players it isc. So, in summary, the characteristic function (not normal- 


ized) is: 
v({1}) 
v({1, 2}) = 


I 
Sy 


,  a({2}) = a({3}) = 9, 
» w({1,3})=«4  ({2, 3}) = 0, 
v({1, 2, 3}) =. 


~ 
a er ett rere paenatie 
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The first “theory 
analy: 
buyer will be included 
the stronger buyer, pla wil 
in which case either can participate. 
the commodity player 1 can get at least b, and since player 3’s limit ;. 
he can get no more than that. If c > b, player 3 can always exclud. 
player 2 from the bargaining by paying something more than 4 for the 
object. Thus, the only imputations which may arise, if this argument j 


valid, are those satisfying: 


BS xy Sc, x2 = 0, x3 =C¢C— x. (1) 
As we shall see, these imputations are very important and common to 
all the theories; the several theories differ in what other imputations they 
include. 
ata theory we shall consider is that of the core. It will be 
“See Gee 6:6) that when the argument used to limit n-tuples to 
ee, - te carried to its logical conclusion (assuming no limitations 

rm i : ae 
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9.5] 


sens 


definition © 
These extra imputations presumably arise when players 2 and 3 form a 


coalitio : ie 
t location of the price between a and b depends upon their bargaining 


e argument; presumably, the reason for the difference is that the 


f a solution takes into account the possibility of coalitions. 


n and agree not to pay more than 4 for the commodity. The 
exac 
ability relative to player 1. Having bought the commodity, the problem 
of dividing the spoils remains—the one who keeps it must pay the other 
for his cooperation. The exact price, in terms of the selling price x, is 
given as one-third and two-thirds of ¢ — x. Clearly, this function is 
highly special, and there must be other solutions with different schemes for 
splitting up the payment. Actually, it can be shown that, if f and g are 
any two monotonic decreasing functions of x such that 


f(x) 2 0, g(x) 2 0, x + f(x) + g(x) Sy Gs 


for a < x < b, then the imputations of the core (eq. 1) plus those of the 
form 


(x, f(x), g(x)), 


where a < x < b, forma solution. Furthermore, all solutions are of this 
form. The core alone does not form a solution to this game because 
there exist imputations outside the core which are not dominated by any 
member of the core. However, the core must be in every solution, for if 
it were not then the imputations in the core, i.e., the undominated impu- 
tations, would fail to be dominated by an element of the solution, which is 
impossible. (Shapley [Kuhn, 1953 a] has conjectured that the intersec- 
tion of all solutions of a game is the core, but this has not yet been proved. 
Gillies [1953 6] has given a necessary condition for the core to be a solu- 
tion.) Now, according to von Neumann and Morgenstern, the choice of 
the pair of functions f and g, which determines the division of the spoils in 
s of the selling price, and hence the choice of the solution, depends 
on the “standards of behavior’ of the society. Put more colloquially, 
ends upon the “going” price for this type of cooperation. 

might be tempted to inquire why the interpretation of the solution 
$ mention of only the coalition {2, 3}. Although we could discuss 
, it will be more illuminating to reserve our comments until later 


1 10.3), 


URTHER RESULTS ON SOLUTIONS 


$s a considerable literature on solutions that presents partial 
| results about solutions for various classes of games. We do 
examine these theorems in great detail, for that would 
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As we already know, for the three-person i 7 
a plethora of solutions; this also appears to = the Case | 
four- and five-person constant-sum games. ¥ et, with this 2 
of solutions for games with small n, it is still not known if eve 
possesses a solution. For example, it is not even known if every { 
person game has a solution. From the first systematic presentation , 
n-person game theory to the present, the existence of a solution to eve; 
game has been considered the most important unresolved problem. The 
reader is referred to discussions in Kuhn [1953 a] and Wolfe [1955] for 
Oe Neumann’s views on this problem and his suggestions as to Feséarch 

irections. 
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9.5] 


We may fairly c 
: : : ; hfs ; ai 
mentioned in section 9.2, there are mathematical difficulties surrounding 


“ solution notion, or, at least, the mathematical problem is difficult. 
At this stage it is not clear whether this will stimulate deeper insights into 
i. concept or whether it will prove so discouraging that little more will 
be discovered about solutions. 

Assuming that at least some people will be discouraged, there appear to 
be two other possibilities: (1) to single out some of the more regular solu- 
tions as more important than others and to study only these, and (2) to 
introduce new concepts more or less in competition with the solution 
notion. In section 9.6 we shall deal with an example of the first approach 
and in Chapters 10 and 11 with several examples of the second. But, 
before pursuing these topics, a more technical summary of the solution 
jiterature will be given for the mathematically oriented. 


onclude that, in addition to the conceptual difficulties 


> In attempting to prove that solutions exist for all games, one possible procedure 
is to treat the domination relation over the space of imputations as an abstract 
relation over a set of points. The definition of solution can be given in this con- 
text without reference to the fact that, in game theory, the points will be imputa- 
tions, The problem then becomes one of finding conditions which ensure the 
existence of a solution. In von Neumann and Morgenstern [1947] it was shown 
that, if a relation meets a condition, which we need not specify here, then a solu- 
tion exists and it is unique; however, the condition is much too strong to be of 
general interest in the theory of games. Richardson [1946, 1953 a, 1953 6, 1955] 
has pursued this direction much further and has shown the existence of solutions 
under different and fairly weak assumptions, but to date he has not obtained a 
theorem which guarantees a solution to every game. 

Earlier we mentioned very briefly the example produced by Shapley [1952 c], 
Which shows how irregular solutions may be. The solution he presented is based, 
Mm part, on an arbitrary closed set C of an (n — 3)-dimensional subset of the space 
of Imputations. This paper “‘ - - * provides at one stroke a large fund of ‘patho- 
logical’ examples against which conjectures on the bahavior of - * - solutions 

te be tested.” [1952c, p.1.] ‘The arbitrariness in the choice of C (for exam- 
- ple, C may be a Cantor-type discontinuum) makes it easy to dispose of many con- 
mea” concerning the regular behavior of « °° {solutions].”” [1952 c, p. 2.] 
____~he remaining papers on solutions are all concerned with the (partial) charac- 
Zation of the solutions of specific classes of games. 
tis known that every four-person constant-sum game has at least one solution, 
the solutions of a few of these games have been studied in detail. Von 
1 and Morgenstern [1947] showed that these games can be put into 
correspondence with the points of a cube, and they investigated some 
games in considerable detail; however, the only ones for which they gave 
utions are those corresponding to the vertices of the cube. Later Mills 
mined all solutions corresponding to the edges of the cube, and he has 
concerning the faces. In the area of general-sum four-person and 
1 five-person games, Nering [Kuhn, 1953 a] has found some solutions. 
of games for which detailed results are known are the simple ones, 
property that (in 0, 1 normalization) either v(S) = 0 or 1 for every 
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Bott [1953] introduce 
fe asi These are defined by the property 
0 if |S| < k 


us) = {4 if |S] > &, 


where > n/2 and |S| denotes the number of players in 5. In this paper he 
examined the symmetric solutions. In [1953 a], Gillies studied the non-sym- 
metric or discriminatory solutions to such games. In the words of Kuhn and 
Tucker, “Dropping symmetry, D. B. Gillies exhibits * * * a surprising variety of 
other solutions of (n, k)-games, all derived from Bott’s symmetric solutions. 
Gillies’ solutions are obtained by several methods which may carry over to a more 
general context: (1) by the addition of ‘bargaining curves’ (von Neumann and 
Morgenstern [1947], p. 501), (2) by inflation to larger games (:bid., p. 398), (3) by 
‘discrimination’ (tbid., pp. 288-289) in which the non-discriminated players 
divide their take according to any solution to a smaller game, or (4) by partitioning 
the players into fixed subsets, assigning the spoils arbitrarily (i.c., in all admissible 
Hew é one ete) among these subsets, and then dividing the spoils in any one 
th nab Se the symmetric solution to a smaller game the players think 
- $e playing.” [Kuhn and Tucker, 1953, p. 304.] 
third class of games which has received attention was defined by Shapley 


[1953 a). A quot : = Rati 
5 ESS Sehwag ls one for which it is possible to find n numbers , “2; 


o(7,) = @1 + we + eae + W, 


o({i, j}) = 0; + w;, for all i and Jot Fj. 


the entire class of quota games, a lass 
» all constant-sum four-person games, and 
han four players. In a typical imput™ 
two or three of the players receive the 


9; Pp. 304-305.] Kalisch [Kuhn, 1953 ah 
1 


‘ , € solutions, a] 
u 2 all 

hs period nat am Tucker, 195 

results t : 

there exists an j © what he calls m-quota games, i.e., games for whic 


mputation @ — 
(1, We, ++ ®@n) such that v(S) = : oy; for 
- m members, 3 
3 €rences s 
iS toed upon the Da id - 
8 to sy. ts 


all Coalitions if wi 


mentioned. A game is symmetric if (8) 
se in S. Gelbaum [Kuhn, 1953 4] s 
hy , otmes. Shapl charac’ 
‘ rt Ol gam : apley [Wolfe, 1955] has a 
in the soluti xd interesting featu : arrting from a consideration of certain econom* 
ion which can OEE: of ¢ Study is the existence of a phenomen” 

preted 


as the market price. The summaries 


Strong Solutions 213 


9.6] 

he two Conferences on Game Theory, held at Princeton in 1953 and 1955 [Kuhn, 
e wie we ‘ ns 

1953 a; Wolfe, 1955], contain, in addition to reports of specific results, some dis- 


he solution notion and the opinions of various workers about the future 
direction of the theory. Finally, many of the papers referred to above which 
either are unpublished or have only appeared as technical reports will in fact 
appear jn print at about the same time this book is published; see Luce and 
Tucker (1958]. Our reason for not using this reference is that their table of 
contents was not settled at the time we were writing. > | 


9,6 STRONG SOLUTIONS” * 


The principal question to be discussed in this section is whether, aside 
from “standards of behavior,” there are game-theoretic requirements 
which impose a greater stability on one solution than on another. This 
problem and the ideas here discussed were raised by Vickrey in some 
unpublished work [Kuhn, 1953 a; Vickrey, 1953]. 

The central idea is this: An imputation in a solution is likely to be 
adhered to if and only if any contemplated deviation to an imputation 
outside the solution will invite a corrective action, leading back into the 
solution, which results in a net loss to one of the players effecting the 
deviation. It will be recalled (section 9.1) that the solution 


F = {(14, 4, 0), (4, 9, 14), (0, 74, 72)} 


of the three-person constant-sum game has the property that, although 

the imputation (0, 34, 14) dominates (14, 4, 0) of F, it is in turn domi- 

nated by (14, 0, 14) of F, and the net result for player 2 is to pass from a 
_ payment of 14 to one of 0. 

__ With respect to a specific solution A, an imputation in A is called con- 
forming; one not in A, non-conforming. Among the non-conforming impu- 
tations some dominate one or more conforming imputations; these are 
Called heretical imputations, and an effective set for such a domination is 
da heretical set. Vickrey discusses the example of the solution F as 


this case the movement to a non-conforming imputation * - - requires 
ration of player 2, who though he may gain immediately, finds that 
may have been difficult to move from (14, 14, 0) to (0, 34, 14) it is now 
ier for the couple {1, 3} to organize a movement to the conforming 
4, 0, 14) to the great discomfiture of 2. - « - If 2, finding himself 


‘the remainder of this book depends upon the material in this section; 
, not been starred because it is largely conceptual and the level of diffi- 
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One might hope that all solutions would have this propert 
would serve as an added rationale for accepting them as a descripti 
stable social pattern, but they do not. To show this, we consider one 
the discriminatory solutions of the three-person constant-sum gam 
Suppose we take F3(14), which consists of all imputations of the forn 
(x1, x2, 14), where x1 + x2 = 34 and x Z Oand x2 2 0. Suppose tha 
the players are at the imputation (7 9, 242, 14) of this solution. The non- 
conforming imputation (0, 44, 14) is heretical since it dominates the given 
imputation of F3(14), the effective set being {2, 3}. Of course, it is 
dominated by a conforming imputation, for example, by (2/2, 12, 14), 
the effective set being {1,2}. But, observe, the net effect of the heresy has 


been a temporary gain for player 3 and a permanent gain of {9 for 
player 2. 
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tion for a _ soe 
imputation of the solution. It is important to recognize that the heretic 
who will be punished is not in general uniquely determined, which of 
course means that the argument against heresies occurring is much weaker 
than it would otherwise be. On the other hand, A is said to be weak if for 
every x of A there exists at least one heretical y with effective set T such 
that for all z of A which dominate y, and alli in T, 2; 2 x. 

For the constant-sum three-person game, as we have suggested, the 
symmetric solution F is strong, and all discriminatory solutions F;(c), 
j = 1, 2, 3, are weak. 

For games with more than three players there are solutions which are 
neither strong nor weak, but rather there are intermediate notions of 
strength. Primarily, however, one is interested in the strong solutions, 
for which every heresy is dangerous to some member of its heretical set. 

At present, the only procedure available to determine the strong solu- 
tions is to determine all solutions and examine each one separately. Thus, 
Vickrey has been restricted to studying such cases where all the solutions 
are known, i.e., to some four-person and some simple games. In sum- 
mary, he finds that 


t least one of the heretics once it is corrected by a dominating 


For constant-sum games, the concept of the strong solution has thus far appeared 
to be fairly effective in narrowing down the number of solutions that have to be 
accepted. When it comes to the variable-sum games, unfortunately, it appears 
that much of the selectivity of insistence on strong solutions disappears. For one 
and two person games, all solutions are already strong, while for three person 
games, it appears that insistence that * * * solutions be strong offers only a rela- 
tively small reduction in the range of possible imputations. (1953, p. 32.] 

: No attempt has as yet been made to try out the effect of insisting on strong solu- 
tons for variable-sum games for more than three persons, so there is no way of 
telling whether the concept would prove more restrictive in such cases or not. 
The complexities and variations possible between the extremes of strong and 
weak solutions already observed for the four-person constant-sum game indicate 
that the analysis of such games may prove to be extremely difficult. On the basis 
of the experience with the three-person games, one is inclined to be not too san- 
eine. The strong solution, that appears to be such a potent device for the 
simplification of the results of constant-sum games, may, it appears, be of rela- 
tively little value for the variable-sum games, although this tentative hypothesis is 

fdly more than a conjecture. [1953, p. 35.] 


ee *).7 SOLUTIONS OVER DOMAINS DIFFERENT FROM IMPUTATIONS 


. When the concept of an imputation was first introduced (section 8.6) 
question was raised whether it was not too restrictive; in particular 


Second condition, v(In) = > x;, was challenged as not following 
iin In 
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Shapley [1952 6] and Gillies [1953 b] have isolated the following fo, 
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xi < o(I,). 


tin In 


E is the set of n-tuples x such that 
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+ No element in A dominates another element in A. 

Is 


« Byery element of C not in A is dominated by some element of A. 
ii. 


An stable set is therefore another way of speaking of a von Neumann- 
Morgenstern solution. 

There are four theorems and several examples which establish the rela- 
tions among the C-stable sets for the four classes of outcomes which have 
been defined. First, a set A is E-stable if and only if it is Z-stable [Shapley, 
1952 b]. That is to say, if one is concerned with stable sets, then it is 
immaterial whether one chooses E or E as the set of n-tuples, for no E-stable 
set has outcomes lying outside the set E. Thus, with respect to theories 
of stable sets where we do not choose to impose the condition of individual 
rationality, there is no loss of generality in imposing the condition of group 
rationality. However, as we have made clear earlier, the condition of 
individual rationality appears to follow directly from the basic postulates 
of game theory, so our real concern is with the sets J and J. Gillies 
[1953 6] has shown that a set A is /-stable if and only if it is -stable. Thus, 
when the condition of individual rationality is imposed, there is no loss of 
generality so far as solution theory is concerned in the further restriction 
to imputations. This theorem eliminates insofar as solution theory is 
concerned any a priori objections (such as those discussed in section 8.6) 
to restricting attention to imputations, but, of course, it does not obviate 
our remarks of section 8.6 that the solution notion is a means of bypassing 
the logical extension to all coalitions of the intuitive argument supporting 
§roup rationality. Since the solution notion, or more generally that of a 
C-stable set, is not clearly acceptable, it may be desirable to create an 
entirely different theory—one in which the condition of group rationality 
may not be immaterial. 

mi _ In the quotation given at the beginning of this section, Shapley points 
out that it would be interesting to know whether or not the notion of domi- 
ot between outcomes implies individual rationality. Given the above 
s, this reduces to an examination of the relation between J-stable and 
sets. He has shown that a set A which is E-stable is also J-stable 
Only if every outcome in A is an imputation, i.e., if A is a subset of J. 
‘Teverse direction, he has shown that if A is a solution, i.e., an J-stable 


x in A such that x; = v({7}). These results strongly suggest 
ble to have E-stable sets in which some, or indeed all, of the 
hot imputations, i.e., in which individual rationality is 
y also suggest that there may be solutions which are not 
y has exhibited examples of both possibilities (an 
quota game with a weak player which includes no 
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9.8 SUMMARY 


Solutions, the central equilibrium concept for games in characteristi 
function form, has been our topic throughout this chapter. An imputa- 
tion y is said to dominate another imputation x with respect to the coalition 1 
if it is both feasible, i.e., 


() (7)2 Ys 
iin T 
and better than x for T, ice., 


(ii) y; > x, for every i in T. 


y simply dominates x if there is some T such that it dominates x with 


respect 
i ct to rT. In terms of these concepts, a solution is defined to be a s¢t 
of imputations such that: 


1. If x and y are in A, neither x n 


il. For any imputation z not in A 
x dominates z, 


or y dominates the other. 
there is at least one x in A such that 
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Morgenstern argue that these eee questions beyond the game framework, 
that the former depends upon ‘standards of behavior” in society and the 
Jatter Upon the “bargaining abilities’’ of the players. 

These difficulties and criticisms notwithstanding, the concept gives 
considerable insight into at least some situations in which coalition forma- 
tion is allowed. ‘This was illustrated by the simple example of a three- 
person market. 

Vickrey’s concept of a strong solution is the only attempt to narrow 
down the number of solutions to be considered. Roughly, a solution is 
called strong if the sequence—an imputation in the solution, a change to 
a non-conforming imputation, and a return to an imputation in the 
solution—always means that at least one of the players participating in the 
original deviation ultimately suffers a net loss. Thus, a strong solution has 
an inherent stability not possessed by other solutions, and so it might be 
expected to occur rather than one of the weaker solutions. There seem 
to be relatively few strong solutions in constant-sum games, but the restric- 
tion is not so effective for non-constant-sum ones. 

Earlier we raised a question about restricting our attention to imputa- 
tions, for there seems to be no valid argument based on individual ration- 
ality for imposing group rationality. In the final section we examined 
results concerned with this problem for solution theory. The central 
conclusion is this: so long as individual rationality is imposed, no loss 


of generality results in solution theory from the further restriction to 
imputations. 


chapter 10 


W -STABILity 


10.1 Y-STABLE PAIRS 


: : -e are three 
Aside from the solution concept and its ramifications, = Sibel 
other topics in characteristic function theory which have yea We 
tion. Although two of these (y-stability and reasonable ov ils 
tinue to be concerned with outcomes which can be argued to be = : es 
all three notions differ appreciably from the solution concept. . fi me ie 
ple, one of the salient differences of the definition we shall ee 
chapter is that it does not deal with imputations or sets of impute 
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from consi 
their alliance structures. Roughly 

form of “friction.” If such is the case, the 
formalizing these limitations on coalitio1 
model is both somewhat realistic and at the 
tractable. One proposal—the one we shall 

the latter part of section 7.6, namely, that these lu 


rule y of admissible coalition changes which state: 


dering at one moment any but the simplest 


the players into Coalitions those (admissible) sets of players wh an take 
joint action. Since the following material rests heavily upon this notion, 
it may be advisable to reread section 7.6. 

Assuming that such a function describing the limits of coalition change 
is given, Luce [1954, 1955 a, 1955 b] has attempted to construct a theory 
for games with side payments which parallels in spirit both the suggested 
approach to a cooperative theory when side payments are prohibited 
(section 7.9) and the definition of the core (section 8.6). In this approach 
one searches for equilibrium outcomes where, as we mentioned, one part 
of the outcome is the arrangement of the players into coalitions as given 
bya partitions. Although one might like to use the strategies employed 
by the players as the other part of the outcome, these will not work here 
any more than in solution theory, for they do not tell what side payments 
were effected, and hence do not tell the total payment to each player. 
Thus, we are led again to use imputations rather than strategies as a 
description of the second aspect of the outcome. ‘The task, therefore, 
becomes one of describing those pairs [x, 7], where x is an imputation and 
7 is a coalition structure, which are in equilibrium when the game is 
described by its characteristic function v and when changes in collusive 
arrangements are limited according to a given function y. 

Suppose that [x, 7] is a candidate for an equilibrium pair, and let S 

be any of the coalitions in y(r), i.e., S is any of the coalitions which might 
form if the players involved so desired. Suppose that the players who 
would constitute the coalition S add up the total amount they are receiving 
in the imputation x and find that this sum is less than the total amount the 
Coalition § may expect to receive, i.e., 


xi < v(S). 
iin S 
Then they have a distinct motive to change from their arrangements in 7, 
Whatever they may have been, to form the coalition S; for by employing 
€ payments—and this is their crucial role—each player of S can receive 
____ More than he is assigned in the imputation x. Such a disruptive force is 
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We observe that this is exactly the same condition 9 cl : hold 
for all coalitions if we accepted the logical consequence ine 


for group rationality needed to limit payments to perputa tio ion 8.6) 
i i here it is not imposed for all co: tt oal 

The only difference is that here | | 
for those determined by the coalition structure 7 and the given function y. 
A second condition seems plausible if the pair [x, 7] is to be in equilib- 


rium. A player who participates in a non-trivial coalition (one having at 
Jeast one other member than himself) must thereby benefit in the sense 
that he must receive more than he can assure himself under the most 
adverse circumstances, namely, opposition by a coalition of all the other 
players. This is, in a sense, an extension of the argument, based on indi- 
vidual rationality, which led to the condition x; 2 v({z}) for any equilib- 
rium payments. It is now argued that not only must this condition be 
satisfied but, if 7 is in a non-trivial coalition, then x; > v({z}). 

These two conditions are taken to characterize the equilibrium pairs, 
so in Surnmary we make the following definition: A pair [x, r], where x is 
an imputation and Tf is a coalition structure, is ¥-stable for the game with 


ch . . . . i 
aracteristic function v and given “boundary condition” y if the following 
two conditions are satisfied: 
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hange (see section 7.6) from every coalition structure if 

y''-stable pair of the form fetes gaye lap, then 

empty. if, es : 
If the characteristic function v and the boundary condition ¥ 


unspecified, as in the definition, little more can be said about the exist 

and nature of y-stable pairs. If, however, certain spe ific functions 4 
are chosen, then it is reasonable to expect some theorems to hold tor varl- 
ous special classes of characteristic functions. So far in the literature only 
one class of functions y has been examined; it can be roughly described 
as including some of those functions with the property: S is in (7) pro- 
yided that there is a coalition T in rt which is not “‘too different” from S. 
For these functions certain general theorems do hold. At the beginning 
of the next section we shall give the precise definition of this class and 
state some of the theorems; the casual reader may, if he chooses, skip over 
this material, which is in small print, to the discussion of the faults and 
virtues of the y-stability concept. 


10.2 CRITICISM 


As background for a critical discussion of y-stability we shall present a 
summary of some of the theorems which have been proved: 


DA special class of boundary conditions must first be defined. Let k be a fixed 
integer lying in the range 1 to n — 2, where zis the number of players. We shall 
also denote the function we are about to define by f, i.e., for any 7, k(r) will denote 
the set of admissible coalition changes from 7. The coalition S is included in 
K(r) if and only if there is a coalition T in 7 such that the sum of the number of 
players in § who are not in T and the number of players in T who are not in $ 


does not exceed the integer k. Put in symbols, Sis in k(r) if and only if there 
exists a T in r such that 


Ws — NUT - | =|8— TIF IT-S1< 6 


where |S| denotes the number of players in S.!_ If we take the point of view that 
is a basic coalition which is modifying “‘itself’ by adding and subtracting 

Players, we demand that the number of players expelled from T plus the number 

of new ones added in order to form S shall not total to more than s players. When 

We are using this function we shall, of course, speak of k-stability. 

a oe the following discussion we shall not admit the coalition of all players as a 

sa € coalition structure. Under that restriction, the function (n — 2) is the 
i the function which admits every coalition as a possible change. 

ae not difficult to see that increasing the size of k admits more coalitions as 

k € changes, and so, if k’ > k, a pair which is k-stable may very well not be 


, 

Stable; but a pair which is k’/-stable is surely also k-stable. 

1 
mi and T are sets, S — T denotes the set of elements in S which are not in T, 
Sand 


= set of elements in S or in T or in both, and S/)\T the set of elements in both 
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called losing; those with v(S) = 1, winning. Theorem: A sil , 
if and only if either there is no winning coalition of k + 1 mem here ig.as 
least one player who is a member of all such winning coalitions Ww consider 
the three-person constant-sum game: It is simple, and all two-« coalitions 
are winning; since no player is common to all of them, the game is 1-unstable. 
More detailed results on the 1-stability of simple games are known. For 
instance, it is possible to describe the form of the 1-stable pairs, although that 


description is too complex to present here. More interesting and easier to 
describe are those simple games which are 1-unstable. To do so, we shall need 
the concept of a decomposable game, which is one consisting of two non-interact- 
ing games on complementary sets of players. To be precise, if there exists a set 
of players T such that for every coalition a 


u(S) = o(ST) + o(S — i). 
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The study of four-person constant-sum games 
becomes simply a matter of specializing the above results. MV 
constant-sum condition that they are all 2-unstable, but only those wit! 

Javer are also 1-unstable. For those with no weak player, it is not very difficult 
to show that the only imputation which can arise in a 1-stable pair is the quota. 
The pair io, (11},. {2}, 13}, {4})] is always 1-stable in such games, and, for 
most of them, it is the only 1-stable pair; however, there are certain games in 
which other coalition structures combined with w are also 1-stable. Gq 


Before criticizing the concept of y-stability, let us make certain sum- 
mary comments concerning its virtues. Mathematically, the more 
restricted concept of k-stability is comparatively easy to work with, much 
easier, say, than the solutions of von Neumann and Morgenstern. Evi- 
dence of this is the fact that k-stability results are known for all constant- 
sum four-person games, for all simple games, for all symmetric games, and 
for all quota games; whereas, to date it has been possible to obtain informa- 
tion about solutions only for special cases of these classes of games. In 
cases where results are known from both theories, the k-stability results 
are generally the simpler to state but are still not intuitively obvious. 
Although the notion is mathematically plausible, for social science more 
is needed: the definition must have some intuitive merit and, possibly, some 
empirical usefulness. ‘There are two points in its favor conceptually, and 
the first may also be valuable empirically. First, the state of equilibrium 
is described both in terms of an imputation and a coalition structure; con- 
ceptually, both seem to be relevent. Empirically, it is often easier to 
determine the coalition structure of an existing situation than to deter- 
ae a made to the players (see, for example, the illustration 

a ction 12.2). Second, the spirit of both conditions of the 
is the same as that underlying the limitation of payments to 
fro ations; in solution theory the defining properties depart somewhat 

m the considerations which led to imputations. 

The discussion of the empirical merits and inadequacies will be left to 

apter 12, where a comparison with experimental data is made. 

‘ ae of the notion fall naturally into six groupings: 

é eh of payments to imputations. When we first discussed n-tuples 
s to players we raised serious objections to the condition of 
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that subgroups could not also be rationa : 1€ a ‘i 
is to limit the subgroups to which this argument 1s pep: 
eral, one suspects these to be coalitions not very an “ 
already existing in the given coalition structure 7. B y 
coalitions will not include the set of all players; yet the r f pay- 
ments to imputations amounts to including the set of all | in W(r). 
This is unacceptable. If, however, one examines the definition of y-sta- 
bility, there is no reason why x cannot be any n-tuple satisfying x; 2 v({7}), 
for every i, and so the restriction to imputations did not actually 
basic flaw. Of course, it does not follow that any of the theorems stated 
for imputations are true in this more general context. By and large, the 
results are not greatly changed, but there is no point in going a the 
details. G 

ii. Introduction of the functions y. A very serious criticism of y-stability 
theory is the very introduction of the peculiar functions y, functions which 
Seen 
enforced, then, even if it is peaible to a a puch are not rigidly 
BP aterill ine’ to one’s esti imate y, there is no assurance that 

mates. 


acceptance shou 
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tions according to 7 We ‘eal when the players are arran ved in coali- 
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rred so often that frequencies can be obser 


recu 1] : " ; 
jssumed, the theory can presumably be const 
are des a oe ‘ 
that assertions of the form “|x, 7] is ¥-stable” will 
€ a ss . 1 . 
sentences such as “x, t| is stable with probabili 
NS) 


development of such a theory appears to rest heavily upon the im] 
of suitable regularity conditions on the probabilities—con ns whicl 


parallel the restriction of the general functions y to the particular cl rf 
functions k. 

If, as may be the case, this whole direction seems too unreal, then it may 
be necessary to return to the foundations of game theory and to recon- 


struct them. ‘The role of these functions is to introduce some sociological 
assumptions into the model. Such assumptions do not occur in its under- 
structure, and evidently they are not necessary in two-person situations 
with strictly opposed interests, but we would suggest that both the diffi- 
culties of n-person theory and an intuitive consideration of conflict of 
interest indicate that sociological assumptions are necessary in general. 
It would be well to have them in the foundations. 

iii. The expectations upon which the equilibrium is based. Basic to the notion 
of y-stability is the idea that the individuals who might potentially form 
the coalition § will compare with v(S), i.e., with their total expected return 
when both § and —S form coalitions, the sum total of their existing pay- 
ments. This comparison might serve for a normative theory if the coali- 
tion structures of all y-stable pairs consisted of but two coalitions— 
actually this is rarely the case—but empirically it seems ridiculous. In 
Society, we can observe the creation of coalitions which could not possibly 
benefit their members if all remaining participants in the conflict of inter- 
“st were to band against them. Of course, such a unified opposition is 
hot expected, and, experientially, this view is justified. Presumably, 
the Participants in a coalition do have some expectation as to the reaction 
and reorganization of the other players to the new coalition, and it is on 
et of these estimates, which may be very long range, that the deci- 

€ached whether to cooperate or not. 

— are really two parts to this criticism. The first is, in large meas- 
a of our earlier comments that in general the characteristic 
a d a inadequate representation of a conflict of interest. None- 
< ou a expectations the players have for the coalition S, it can- 
i. i v(S). Thus, v(S) is a conservative estimate of their pros- 
stable ti ce that, whatever their estimates are, any pair which is 
unction a eir estimates is also y-stable using the characteristic 
a ae fi converse is not true. The set of y-stable pairs based on 
equilibriun, oa function will include anything that is aciually in 
» Plus, In general, some pairs which will not be in equilibrium 
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a coalition, it gets v(S) regardless of what the ss In tha 
case, we would probably want to add the conc o Coalition 
only divide up what it actually gets, 1.€., We would o nsider pairs 
(x, 7] such that for every Pin 7, m= oT). 
iin T 
The second criticism is much more profound. Once one admits that 
the players make predictions about the future in determining the worth of 
a change, one cannot argue that only immediately profitable coalitions 
will form. A coalition which results in an initial loss to its members ma} 
count on a reactionary realignment of the other players which is so 
unstable that, in turn, it will pass to another structure which allows sub- 
stantial returns to the given coalition. Such a possibility seems meaning- 
ful only when there are limitations on coalition changes, but in that case 
ae ee ne We do not see how to cope with such possibilities 
: mework, 
ee on of the players. In contrast to the preced: 
at there may be y-stable pairs which ar¢ 


not, in fact, in equilibri 

) quilibrium, we sh aca 
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ple Jong-range considerations. 3 Wh 

yseful to Cope with such problems is not kn 
y, Existence. Given a game v and a bou! 

not exist any y-stable pairs. This can ev 

conditions are extremely restrictive; for e 


im 


stant-sum game has no 1-stable pairs. As 

restrictions tends to reduce the number of stabl 

no restrictions, the only games with stable pairs are those with ¢ 
empty core. Certain authors, notably von Neumann, have appar ntly 
felt it extremely important that, as for two-person zero-sum games, 
equilibrium states exist for all games, whatever the equilibrium concept 
may be. Von Neumann felt, for example, that the most important prob- 
lem of n-person theory is a proof or counterexample to the theorem that 
every game possesses a solution [Kuhn 1953 a], and the implication is that 
a counter example would be very damaging to solution theory. We cer- 
tainly agree that, mathematically, an existence theorem is very elegant to 
have. Furthermore, if we accept a solution as our idea of social equilib- 
rium, then a proof would establish the existence of social equilibria under 
incredibly weak assumptions. Yet if the theorem does not hold for solu- 
tions— as it does not for y-stability—we should not be unduly disturbed, 
for it is perfectly plausible that there are social arrangements, just as there 
are physical ones, which are incapable of sustaining a state of equilibrium. 
This is not to say that these situations are uninteresting—an explosion 
hardly epitomizes dullness—but only that they are transient. To study 
transience a dynamic theory is essential, and, as we have repeatedly 
emphasized, game theory is not dynamic. 

With the failure of general existence, the mathematical tasks are not 
eliminated nor, in general, is their difficulty reduced. Attempts must be 
made to characterize those games which do and those which do not possess 
€quilibrium states and, for those with equilibrium states, to describe them. 

Vi. Uniqueness and occurrence. Most games which have a y-stable pair 

* more than one, just as most games have more than one solution. 
a .. problem can be bypassed here in exactly the same way as 
ability = ee by mentioning standards of behavior, bargaining 

-. 2 ‘ e like. ‘To us, this is no more acceptable now than it was 
required ot then, can we say? It appears that one point of view is 
oe “ie the theory as normative, and another if we think of it 
ve es escription, however idealized. If it is normative, or if 
Much the sere in that way, the ee of uniqueness seems to be serious in 
tive es ea as it was in Nash s equilibrium theory for non-coopera- 

ion 7.8) and in solution theory. There will, in general, be 
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rium point isad o is Fat 
The situation is reminiscent of certain physical problems in which there are 


several points of equilibria: the one which occurs depends, among other 
things, upon the initial point of the dynamic system, and to predict jt, 
occurrence requires a full dynamic theory, not just an equilibrium theory. 


The analogy seems close enough, and at least for the present, we shall 
assign the failure to predict the occurrence of equilibrium states to a lack 
of a full dynamic theory of coalition formation. 

If this be true, then great care must be taken in assigning meaning to 
these equilibrium states, for, just as with physical systems, it appears that 
there can be equilibrium states which cannot be reached from any other 
aoe ‘ aan for example, a game having a y-stable pair [x, 7] such 

at all the coalitions T ofr are losing. i.e.. i izati (T) =02 
The question is whether such i a — ae 
a ch a pair could possibly have been reached 
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t society isnot. We should not, therefore, | 
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ently stable states evaporating. One mechan 


effected without changing the underlyin g game 
conditions. We may expect to find a dyna 

equilibrium states given by a function ¥ and the func 
action that first produces a modification of the por 

the modified boundary condition, in turn, determines 
hopefully, more desirable equilibria. The boundary 
frst, be relaxed to render the undesirable equilibrium unstable, and 
tightened to produce new equilibria. It is possible to think of 

trust laws as a socially conscious attempt to increase the restrictions against 
the formation of new coalitions, thereby tending to preserve the status quo. 
The conscious manipulation of sociological restrictions to achieve prede- 
termined states of equilibrium in conflict of interests is an important, if 
dangerous, social tool. 


10.3 THE y-STABILITY ANALYSIS OF A MARKET WITH ONE 
SELLER AND TWO BUYERS 


This section continues the example of a market consisting of a seller, 
who possesses a single indivisible commodity which he values at @ units 
of money, and two buyers, who value it at 5 and ¢ units respectively (sec- 
tion 9.4). The characteristic function was established to be: 


v({1}) =a, — o({2}) = o({3}) = 9, 
v({1,2}) = 6, o(f1, 3}) = 4 ~—-a{2, 3}) = 9, 
v({1, 2, 3}) =e. 


The i ° : 
he imputations arrived at by a common sense argument were the core: 


“ 


bine, xo = 0, x3 =C¢ — x1. (1) 


‘a oe imputations were found to Ore in every solution; however, 
- ility of collusion as embodied in the solution concept led to the 
of other imputations. 
ifs Od ee results of y-stability theory for this example using 
Bien ae as in solution theory, namely: there are no limita- 
fies ae ition change. First, it is not difficult to show that any parti- 
a ag either {1, 2} or (2, 3} as a coalition is unstable, so 
y two different 7’s to consider: 


Be Ata at 
it me Pair [x, ({1}, {2}, {3})] is stable if and only if x is in the core, i.e., 
€ts the conditions of eq. 1. 
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slight modification® of eq. 1: 4 x1 < 4% *2 | 

In other words, when there are no eon ° ee: 
ments into coalitions, y-stability and the = a yh 7 , 
substantial agreement, even though the stad) ay hate 
formation explicitly into account. On the other h al umann - 
Morgenstern have accounted for the existence of imp other than 
those in the core—in particular, those in which player 2 gets more than 
Q—as a consequence of coalition formation. Where he. diate 

In practice, at least, it is easy to see why the y-stability result is probably 
wrong. Player 3 is in a position to threaten 2 in such a way that the 


coalition {1, 2} is not really an admissible change—even though tech- 
nically itis. For 3 can point out to 2 that if they (2 and 3) form a coalition, 
then 2 will receive more than 0 (the amount he gets if the threatening and 
bargaining leads into the core); and if 2 tries to use a threat of defection 
from the coalition {2, 3} in order to demand too much from 3, then 3 can 
threaten to force the outcome into the core. Of course, if {2, 3} forms 
ee agrees not to let 1 get more than a, then 1 can come to 2 and offer him 
een 
ee bis a itt € acceptable; however, at this point 3 would go 
Rither 2 has 2. ‘s € more to join 3 in a coalition, leaving 2 with nothing. 
Miecbthe _ eee see this, or 3 can point it out to him, and i 
Process must agree ~ to wd eee ely well, player © Pe 
surely put them into the core os overtures from 1, for this would Just ® 
, with 1 getting more than they can hold him 
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th ose wi th 


These extra cases — 
4, have agreed not to Dreak uD 


to a bargaining problem between 1 an 

that 1 should get less than bh and after that is se 

bargaining problem to divide the spoils. 

would probably be carried out first for all possible spoils, and then the 
former. 


Although it is probably not reasonable to suppose that an arbitration 
scheme would be employed in this context, it is nonetheless amusing 
to see the results one obtains using Nash’s scheme (section 6.5). Consider 
frst the division of the spoils d between players 2 and 3. The status quo 
point for this bargain is (0, ¢ — b). since 2 can certainly get 0, and by 
threatening to form a coalition with 1 player 3 can keep 2 down to 0. 
Similarly, 3 can get at leastc — bina coalition with 1, and by offering to 
buy at the price b player 2 can prevent 3 from getting more than ¢ — 6. 
The Nash solution to this bargain yields a return (@ — ¢ + b)/2 to 
player 2 and (d + c — 6)/2 to player 3. In the bargain between player 
1 and the coalition {2, 3} the amount to be divided is c and the status 
quo point is clearly (a, 0), so according to the Nash scheme player 1 receives 
(c + a)/2 and the coalition (c — a)/2. Substituting d = (¢ — a) /2, we 
find that player 2 gets (2b — a — c)/4 and player 3 gets (3c — a — 26)/4. 

Three points are worth noting: First, for such an arbitration to make 
sense it is necessary that c + a < 26; otherwise player 2 receives less than 
0. Second, the nearer 5 is to c, the more equal are the payments to 2 and 
to 3, as seems reasonable. Third, from the condition ¢ + a < 2b it fol- 
lows that the payment to player 1 lies in the interval between a and 6, and 
80 this resolution of the conflict is in one of the solutions. 

In summary, this example exhibits quite clearly one major fault of the 
V-stability notion, namely, that it does not grant the players any fore- 
—* when there are no restrictions on communication it renders 
© ——_ the coalition structure ({1}, { iz 3}). Having a player react 

iate offers of gain without considering the long-range effects of 


€ reali : : : : : 
sh ignment sometimes yields conclusions at variance with one’s 
ition, 
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og assumption in this and the preceeding two chapters that there 
ingly a transferable utility in which side payments are effected is exceed- 
restrictive—for many purposes it renders n-person theory next to use- 
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: E . isa 1 SGC Beane 
ty atvays case wanes fin 
visibility. To be sure, such transfers will always - : oe s 
but we shall not assume that the utility sum 1s conserved se ‘ ery side 
payment transaction. Of course, once the transferability a a 
is dropped, the characteristic function form of an n-person game no longer 
makes any sense, for a coalition will not have a unique joint utility for its 
commodity outcome—the joint utility will depend upon the distribution 
of goods among the players. The task is to find an appropriate substi- 
tute for the characteristic function. 

Instead of having a number attached to each coalition 5, we shall sup- 
pose that there is a known lottery whose prizes are various bundles o 
goods, services, obligations, etc., which accrue to S as a whole. T° 
simplify the discussion, suppose the lottery is degenerate in the sense that 
there is but one prize, Conceptually, there is no loss of generality in this 


restriction and the general case would be far more cumbersome to discuss: 
Let C(S) denote the com 
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ee denote the set of all feasible physical distributions of C(S) over the 
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This binary relation of dominance on the set of ordered pairs of n- 
allows US to extend the von Neumann and Morgenstern definition of 
~olution to the case of non-transferable utilities. Shapley and Shubik 
(1953), in an abstract in Econometrica, present a similar argument to shov 
that transferability is not essential to n-person theory. They feel, as do 
we, that the conceptual validity of game theoretic applications does not 
depend upon the existence of a transferable utility. 

The modification of y-stability is just as easy: A pair [x, | is p-stable if 
and only if: 


(i) There do not exist Sin ¥(r) and T in 3(S) such that x; < u,(7) for 
all 7 in S, 


and 
Gi) x; 2 u,{C({z})], and the equality holds only if {2} is a coalition of 7. 


As we said, next to nothing is known about these definitions. Pre- 
sumably, their properties depend upon particular assumptions one might 
make about C(S), for it is clear that the mechanics of effecting side pay- 
ments of physical commodities can be very complicated. For example, 
suppose that the joint payoff to the coalition S = {1,2} isC(S) = {A, B, C}, 
where A, B, and C are non-homogeneous non-divisible goods such as a 
house, a painting, andacar. Suppose that, if3 were to join S to form S ‘= 
\1, 2, 3}, C(S’) = {D, E}, where again D and E are non-homogeneous 
and non-divisible goods. The monetary equivalents for the commodities 
4, B, C, D, and E may differ widely from individual to individual, and it 
is not at all clear at this point how a coalition should decide whether it is 
Profitable to add another player nor how it should go about dividing up 
the joint return. This is true even if they agree in advance to the prin- 
ciple of “equal” shares. In Chapter 14 we shall discuss some mecha- 
tists for fair division in groups which conceivably may be relevant to such 


a theory, but no direct attack has been made using that or any other 
approach, 


oe were two major points of contrast between this chapter and the 
i, ingone. First, the outcome of a game in characteristic function 
ries . not taken to be simply an imputation, or even a set of them, but 
diel together with a coalition structure. Second, equilibrium 
inherent Was assumed to arise not only from the strategic possibilities 
of in the game but also from communication limitations, 1.e., a sort 
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lal “friction.” We took this to mean an idealized boundary condi- 
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teristic function 7 and boundary condition y, the wads 
y-stable if: 
(i) 2S) < ) x;, for every Sin ¥(r), 
iin S 
and 


(ii) x; = v({2}) implies {7} is one of the coalitions of r. 


Some properties of the definition were given for a restricted family of 
functions y and several broad classes of games, including the four-person 
constant-sum, simple, quota, and symmetric games. ‘These results seem 
to have the virtues of being both non-obvious and compact. 

A number of criticisms were examined, of which three seem serious. 
First, the function y is not generally explicit in social situations, and no 
theory has been offered as to how it can be determined. Nonetheless, as 
we shall see in Chapter 12, it is possible to make plausible choices for some 
sions Second, Yale pis are ot general nig, © 2 
ae an a * ena aed how to select just one. We pane 
are assumed to have far be = as peapter.. Third, oe ee 

00 limited views of the future. Specifically, @ 


pair [x, 7] i lly, 2 
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chapter {1 


REASONABLE OUTCOMES 
AND VALUE 


11.1 REASONABLE OUTCOMES: THE CLASS B 


Another author who has considered possible limitation on the outcomes 
of a game is Milnor [1952]. He has suggested three different systems of 
“reasonable” conditions, each of which isolates a subset of the set E of 
outcomes, i.e., of the set of n-tuples x such that 


Xi < v(In). 


tinIn 


. doing so, he has taken “ - - - the point of view that it is better to have 
© set too large rather than too small. Thus it is not asserted that all 
ove Within one of our sets are plausible as outcomes; but only that points 
ide these sets are implausible.” [1952, p. 2.] 
3 he status of these three notions, the first of which will be described in 
a” tends to be a bit obscure because very little work has been 
i ean and so only a few of their mathematical properties are 
= a n the literature their only use has been in the analysis of an 
be ¢ nt which we shall describe in section 12.3. We may, therefore, 
ged with devoting too much space—three sections—to their dis- 
i DH 
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cussion; however, WE feel that each one a ae $6 Mies 
that their relationships to other concepts shou : 

Our pattern of presentation will be to devote ae 
Milnor’s concepts, giving his defense of it, some crilicis sau) 
mathematical properties, variations on the concept tus si 
concepts, and its analysis of the market example ws ne Sore 
studied (see section 9,4). Because some of the argum: jedi. 
require consideration of potential changes of coalition alignments, these 

d to seem a little involved; we can only ask the reader to 


sections may ten 
bear with us. 


The first set, B, consists of all those outcomes x of E such that for every player i 
5 « . é ‘ - 3 ty 
x; < b(i), where b(i) is defined to be the largest incremental contribution player i 


makes to any coalition, 1.é., 


bi) = =e [o(S) — v(S — {2})]. 


a in offered by Milnor in support of this condition is extremely 
a aa 4 Ee play of the game, player 7 will wind up in some coalition 5 
> ua ae * {7} would be foolish to keep 7 in their coalition if he tries 
a ., that they could do better without him.’’ [1952, p. 3.] 
—_ f > — seems quite weak and perhaps less reason- 
followin al definition of B itself, for there exist games having the 

& property: There are non-overl 2 as s having 
players 7 and j in § such t: erlapping coalitions S and 7, and 

‘is? é such that 7 is not tempted é ; 
J stays in S, but if 7 move p a to move out of S provided 
S— {7} to ROG Is such a evi P then i can profit by moving from 
Case, if 7 is important to S, it may behoov¢ 
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to pay 7 mor 
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a that he can employ against a particular coalition 

pawevels this may not be correct. Consider, fo 

formed, where S is one of the coalitions in which player 7 mal 

sacremental contribution, i.e., o(5) — v(S — j2}) = b(i P 

attempt to demand more than b(?) on the grounds that ! 

more he will defect from S, that this in turn will cause several other players 
to defect from S, and that in the ensuing chaos and realignment into coali- 


tions the members of § — {2} would probably be worse off than if they 
had paid him. Of course, the other members of S$ might counter with the 
equally plausible argument that 7 too will be worse off, since no other 
coalition will allow him more than 6(7). At this point arguments involv- 
ing interpersonal comparisons of utilities could enter. It is difficult to 
see how to give this formal meaning. However, from a normative posi- 
tion—e.g., were we asked to arbitrate such a game—we feel that no more 
than b(i) should be paid to player 7. Moreover, judging by the data 
reported in section 12.3, we would suspect that “most” people in a char- 
acteristic function game would agree to limit imputations to the set B. 
The following properties of the set B are known: For the three-person 
constant-sum game, B contains the set J of imputations. For four-person 
constant-sum games, B does not contain all of J, but, in at least one of these 
games B does include a sizeable portion of J. In general, it can be shown 
that B includes both the Shapley value (to be discussed in section 11.4) 
and all the von Neumann-Morgenstern solutions.” It is not difficult to 
show that the imputations of the k-stable pairs of any simple game and of 
any four-person constant-sum game are in B, but this is not generally the 
case.* The reason for the incompatibility of these two concepts is that 
Milnor considers all potential coalition structures and allows only one 
Player defections, whereas y-stability considers only restricted changes 
from the coalition structure 7 in the pair. (Incidentally, from the fact 
that the imputation of a y-stable pair need not be in B, coupled with 
inor’s result that every imputation of a von Neumann-Morgenstern 
‘olution must be in B, it follows immediately that the imputations of sta- 
bility theory are not necessarily included in any of the solutions.) 


2a: 
Gillies [1953 6], as well as Milnor, has obtained this result. 
example is the game with characteristic function 


v(S) = { 0, if the number of players in S is not more than 2, 
|S|/n, if the number of players in S, |S|, is more than 2. 


The paj 
casi nt LO © + + , 0, 1), ({1}, {2}, «+ +» {n})] is t-stable since o({i, j}) = 0. Itis 
Y seen that for this game (i) = 3/n, so for n 2 4, 


xn = 1 > 3/n = b(). 


hus : 
> the imputation (0,0, - - - , 0, 1), which is in a 1-stable pair, is not in B. 
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11.2 REASONABLE OUTCOMES: THE CLASS i, 


Milnor’s second class of reasonable outcomes is, in some ways, concep. 
. VAT 9 | e the f, 7 

tually allied to y-stability, and so tothe core. If we make the following 
suppositions about coalition formation, the condition arises naturally: 


i. The bargaining in a game leads to the formation of a coalition which 
is opposed by its complement. 

ii. There are no limitations on preplay communication, so any coalition 
deemed profitable may form. 

iii. In order for a coalition S to form, it must distribute its total payoff 
in such a manner that each subset S’ of S is given at least v(S’). 


If x is any imputation and Sa coalition, let x(.S) denote the sum of pay- 


ments to the players in S, ie., x(S) = x; Suppose that x is an 


. ° e . . : ep . 

ee which arises and is in equilibrium when the players are 

oe according to the coalition structure 7 = (T, —T), and suppose 
ili implies 

ae is satisfied. Then, for any set of players S, condition (iii) implies 


x(S’) 2 v(S’), 


wh bts 
ere S" = S(\T (the part Common to § and T) 


and 
om x(S — §’) > v(S — §’) 
Ing the i iti | 

8 these two Inequalities, we Set as a necessary condition 


aS) 0S ~ s°), 


(ii) implies a upon the particular coalition struct!” 

suppress a at any such structure may form as needee 
right-hand €xpression MS . It is reasonable to take the minimu™ ; 

all possible coalit; : unts 
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Min 

Ss’ [v(.S’) + v(S a Signs 
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e €x ‘ ae a abst ah 
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sa 


nsists Of all those outcomes x of E such that, for 
60 


x(S) 2 U(s) = min 
sr 


S’ a subset of S 


& ~ M4 . > > } ‘ 
pression on the right is denoted by 


» Given the above motivation for the class LZ, the following more refined bound 
may be suggested. If both the coalition structure (JT, —7) and the outcome x 


result, then, by the argument used for y-stability, it is unte nable that either 
a(T).< o(T) or x(-T) < o0(-T). 


Thus, we should restrict our attention to those cases where the opposite inequali- 
ties hold, i.e., for each S, x should satisfy 


x(S) 2 min {o[S1\T] + ofS1\(—T)]}, 
where the minimum runs over all those coalitions JT such that 
x(T) 2 vo(T) and x(—T) 2 vo(-T). a 


The following results concerning Z are known. For the three- and 
four-person constant-sum games, L is exactly the intersection of the set B 
with the set J of imputations. This cannot be generally true, however, for 
we know that the intersection of B and J includes the von Neumann- 
Morgenstern solutions and the Shapley value, and an example can be 
given of a game with a solution not wholly in L and with its value not in L. 
It is not known if L is always non-empty, but at least for many classes of 
games it is not the empty set. Indeed, for many games the condition is 


far too weak to be of much interest (see the market example below and the 
&xperiment in section 12.3). 


>For the market of two buyers and one seller, a simple computation shows that 
KIN) =a, 1({2}) = 1(43}) = 0, (11, 2)) = 4 (1, 3)) =, 
1({2,3})=0,. and | I({1, 2,3}) =a. 


utcomes L consists of all x such that x1 2 a, x2 2 0, and x3 2 0, 
Or th example, that all imputations are included in L. In this case, 
» the condition imposes no real limitations. | 


Thus, the set of o 
Which means, for 


a are Possible variants on the class L which stem from the same or 
ee, uP tions. Let us drop the first assumption that a coalition is 
the ae opposed by its complement, but let us retain (ii) and (iii). If 
mks the a a forms we may suppose that its members will decide how to 
Posie, a. accruing to it, provided it remains intact. Such decisions, 
ae a, : not generally be disclosed to the other players in the game. 
Tey, ac of 2g T enters into negotiation with a subset S” of T to form 
1on, it behooves the members of S’ to mislead the others as to 
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itl i >15 > eS th- s 
a subset of a coalition in a coalition structure 7 1S to rece less than it can 


command in the most adverse situation, it will know it a a hie demand 
ce fore its “parent” coalition or defect. Thus, it is reasonable to 
require as a necessary condition for a pair [x, 7], wher 4 X 1S an outcome, ” 
be in equilibrium that x(S) 2 v(S), for every s which is a subset of a coali- 
tion Tint. This is simply a case of w-stability theory where y(7) con- 
sists of all subsets of the coalitions in r. Note that when we introduced the 
idea of the boundary condition y we described it in terms of lack of com- 
munication, whereas here we have assumed completely free (but not 
quite honest) communication. 

We may go on to add a second condition which is in a somewhat differ- 
ent spirit. Suppose 7; and 7; are two coalitions in the structure 7, and 
suppose o(T;UT;) > o(T;) + v(T;). It should then be clear to the 
members of 7;\/7; that they can come to a mutually profitable arrange- 
ment, and so the coalition structure will be disrupted. Thus, we conclude 
that [x, 7] will be in equilibrium only if: 

(i) 


and 


For every subset § of a coalition in t x(S) 2 v(S), 


(ii) +e ion subcollection of coalitions iemecey 2, lin" *? Tio 
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x wit 


(i) Itis feasible for —S in the sense that it cz 


and 


(ii) x(S) < 6 and, so long as § demands as muc! . 
rupt the coalition —S by causing defections to occur. 


The outcome x is feasible for —S§ if 
@’) x(—S) < o(—S). 


In a similar way, let us try to give formal meaning to the secor 
| erty. Suppose that a subset T of —S were to defect to S, then, if S§ 
remains an amount v(S\/T) —6 to be distributed among the players in 
T. To make the defection attractive to T, it is necessary that the amount 
that they can receive as a result of the defection exceed the amount they 
were assured of by the outcome x. Thus, S will be unable to disrupt —S, 
and demand 6 for itself at the same time, if 


fi’) x(T) > o(SU T) — 6, for every subset T of —S. 
By setting T = —§ in (ii’) and noting that o(J,) > x(I,) we obtain 
6 > vZ,) — x(—S) 2 x(I,) — x(—S) = x(S). 


So the following formal definition is set up: a payment 6 for a coalition § 
pen unreasonable demand if there is an outcome x such that conditions 
(’) and (ii’) are both met. We observe that a necessary (but not suffi- 
vag Condition for 6 to be unreasonable is that 6 > v(S); this follows from 
(i’) when T is the empty set. 

Conversely, 6 is called reasonable if it is not unreasonable, i.e., if and only 
_ X is feasible for —S [x(—S) < v(—S)], S can lure a subset T to 
- from —S [by giving JT more than x(7)] and still have at least 5 left for 

Mike, (SUT) — x(T) > 3]. 

© set D is defined to consist of all outcomes such that the sum of pay- 
'S to each subset of players is reasonable. 

PA more formal d 


18 to get 
(SUT) = (T). : 
Maximum return: 


Men 


efinition of D can be given. Consider any outcome x, then, 
et T to defect from —S, it cannot expect to get any more than 
Presumably, S would try to attract that T which allows it the 


max fu(SUT) — x(T)]. 
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i max [o(S\ 
§ > dS) = min { . 
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So the set D of reasonable outcomes consists of those outco 

subset S, x(S) < a(S). In other words, d(S) is the most a 
i . . ms bas wa it 

considers joining forces with var1ous subsets of —S, assum ade te 


to hold S down. < 

Relatively little is known about the set D, but an example can be given 
(see the market example below) where neither the Shapley value nor a von 
Neumann-Morgenstern solution are included in D. On the other hand. 


for the three-person constant-sum game, the intersection of D and of E 
(see section 9.7) is very closely related to the symmetric solution F. It is 
the set of payoffs spanned by the three imputations (14, !%, 0), (14, 0, 14), 
and (0, 14, 14) of F—i.e., the set of imputations of the form 


Pt)iee(g0,)+-:(0,5,5)- (BF 
* >»? 3 0 50 mm Altes — a - i. Me = ae) 
Cy BANG») *8\% 9D ( 2 2 2 


where x1, x2, x3 are non-negative and sum to 1. 


nas d({1}) = c, d({2}) = 0, d({3}) =¢ — 4, d({1, 21) =4 
B02 D eee . a —a. Thus, for x to be in D it is necessary that 
Rete tions fiom the been the von Neumann-Morgenstern solutions nor all the 
we restrict our attention to € pairs. It does, however, include the core, sem 
core, imputations in D, that subset of D is identical to the 
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fom Chapter 6, anything can happen 1 
particular it might get Veg and }2, 3} mig 


must bargain for the quantity 1546. F 

iG and if 2 objects and combines with 

reduce 2’s return to zero. Of course, player 2 cai 

what will happen depends upon psychological variables not iricluded in 
the model. This is not our problem here; all we wish to do ara 
out that the imputation y is not a ludicrous starting point. In summary, 


one may say that the stability of the coalition {2, 3} with the imputation 
y arises from the fact that the demands of both the players are unreason- 
able, and any attempt by one to hurt the other will in fact hurt both. 
The following summary comments on Milnor’s three subsets of outcomes 
seem to be justified. As we shall see in section 12.3, in one experiment 
the outcomes did for the most part lie in the sets B and L but not in D. 
Furthermore, conceptually it seems reasonable that they should lie in both 
Band L, for the conditions defining B and L are quite weak and they do 
not attempt to take into account the interlocking threat relations among 
the several coalitions. ‘The set D is more difficult to comment on, for its 
rationalization is somewhat complicated. We like the spirit of the idea 
in that it brings into play the threat power possessed by subsets which will 
defect from a coalition if there is an assured profit to that action, but it 
does not follow the argument through to see what reactions and counter- 
reactions will probably ensue. The above example suggests that it fails 
to capture certain salient aspects of the threat situation. In addition, 
when there are a large number of players the following doubts about D 
asa descriptive idea seem relevant: its defense is based upon the supposi- 
Hon that a coalition is always opposed by its complementary coalition; 
the condition is required to hold for all subsets which, we have previously 
argued, may not be reasonable for a descriptive theory or for certain types 
of normative theories; and there is absolutely no indication of the coalition 
structure which one should find associated with an outcome lying in D. 


11.4 vatur 


, In contrast to the two preceding chapters and the first sections of this 

—.; we shall no longer be concerned with possible equilibrium out- 

entire or games, but rather with the notion of ana priort evaluation of the 
Same by each of its players. Shapley writes, 


cxpettempting to apply the theory [of games] to any field, one would normally 
havin q permitted to include, in the class of ““prospects,” the prospect of 
Giticn play a game. The possibility of evaluating games is, therefore, of 

Mportance. So long as the theory is unable to assign values to the games 
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find his a priori expectation it is necessary to know the probabilities of 
occurrence of each of these various equilibrium states, and that pre- 
sumably requires a dynamic theory. This, then, is a blind alley, and some 
other approach must be found. Not all is wasted, however, for viewing 
the problem of a priori evaluations in this way makes at least one thing 
clear. There is no reason to expect the evaluation to be one of the 
equilibrium outcomes. Suppose the imputation x occurs with proba- 
bility p;, then the a priori expected return to player 2 is 


_— et ) 
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" ‘ult, to give a formula for the function(s), ora systematic procedure 
ee eby it can be determined for specific cases. Such was the approach 

= by Shapley [1953 4]; he listed three apparently weak conditions, and 

= De incly, he was able to show that these uniquely determine an 
| evaluation function—that there can be only one function satisfying the 
~~ Giree conditions, and that there is one. He has called the function so 
determined for each player the value of the game for that player. 
__Westart out first with the idea that a player’s evaluation of a game isa 
‘number, so we may symbolize it as ¢;(v), where 7 denotes the player 
v the characteristic function of the game. Since the numbering of 
players is arbitrary, we may always renumber them in any way we like 
permutation of the original system. This will cause the characteris- 
inction to look different even though it represents the same underlying 
but, since these are only notational differences, players who corre- 
under the relabeling should have the same value. So Shapley’s 
condition is: 


Value shall be a property of the abstract game, i.e., if the players 
ermuted, then the value to player : in the original game shall be the 
the value to the permutation of player 7 in the permuted game. 
a bit differently, the value to a player should not depend upon the 
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where s is the number of elements in S ana Yn(5) ifn! 
The symbol k! stands for k(k — 1) q 3° I, v hen 3 a po it ve 
integer; and 0! = 1. Let us examine this Jorm ila in more detaul. [t is 
a summation over all subsets of the set of players, with a typical term 
consisting of a coefficient—which we shall discuss presently—multiplying 
[o(S) - oS — {i})]. Ifzis nota member of S, then S — {i} = S, so the 
term becomes Zero, thus the formula depends only upon those coalitions 
involving 7. It amounts, therefore, to a weighted sum of the incremental 
additions made by 7 to all the coalitions of which he is a member. It 
may be useful to carry out the calculation in a few simple cases. First, 
consider the general two-person game: 


gy = UO cts, 28) — oCl2b] + Gp CEA) — 066) 


= Wlo({1, 2}) + o({1}) — o({2})]. 


If the two-person game is zero-sum, L.€., v({1, 2}) = 0 and o({2}) = 
~2({1}), then ¢; = v({1}), which establishes that the Shapley value is in 
fact a generalization of the minimax value. 

For the general three-person game: 


2!0! ! 
| $= (ft, 2, 3}) — v({2, 3})] + = [o({1, 2}) — v({2})] 


012! 
ay RUE) = hark 


111! 
| + [o(E1, 3), = 0((3))1 


‘ we substitute into this formula the values of the characteristic func- 
| 1on of the three-person constant-sum game in 0, 1 normalization we find 


= il ‘ ° A : 
| Y%, which, considering the perfect symmetry of that game, is the 
| €sired answer. 


> 
a Shapley value for the market situation previously discussed in sections 
10.3, 11.1, 11.2, and 11.3 is: 


1 = a/3 + b/6 + ¢/2 


do = —a/6 + b/6 
ep d3 = —a/6 — b/3 + ¢/2. 
€ : i i 
well e to show that, ifa <b <c, then ¢1 > ¢3 > $2. This ordering conforms 
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11.5 VALUE AS AN ARBITRATION SCHEME 


In addition to the questions we have raised about the axioms for an 
a priori value, one can also question why we should ever be concerned with 
such an evaluation of the entire game by each of its players. What opera- 
tional use will be made of it? So long as this is uncertain, it is difficult to 
criticize the axioms in a fully convincing manner. What we propose " 
do in this section is to consider an alternative interpretation of the axiom 
and to criticize them from that point of view. We shall look on the vale 
in the same spirit as in our previous discussion of arbitration schemes for 
— games—as an arbitration scheme for n-person games in char- 
acteristic function form. In fairness to Shapley, we must point out he 


has never given this interpretation, and so the fact that the value = 
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| 11.5] ee 
strategic role of each player and not by his |: 

- demands that he restrict his attention to Pa 
ibe players should not be able to point out to him 
would all prefer. The third condition is hard r to rati 
effect, that, if a game can be decomposed into two games, the value 
assigned to a player shall be the sum of the values assigned him in each ot 
the component games treated in isolation. 

Although we took exception to the second condition when value 
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is 
interpreted as a reasonable a priori expectation, there is certainly no objec- 
tion to it as an arbitration condition. ‘The first seems equally acceptable. 
It is only the third that seems to give trouble—serious trouble. This we 
may best illustrate by examples. Suppose v and w are two characteristic 
functions in 0, 1 normalization on three players, where: 


o({1, 2}) = o({1, 3}) = 0, v({2, 3}) = 1 
w({1,2}) = 1, w({1, 3}) = w({2, 3}) 


If u = v + w, then the characteristic values of the one-person coalitions 
are still zero, and 


w({i, 2}) = u({2, 3}) = 1, 
u({1, 3}) = 0, 
u({1, 2, 3}) = 2. 


For the games v and w, the imputations (0, 14, 14) and (19, 14, 0) respec- 
tively seem to be reasonable arbitrated outcomes, and they are in fact the 
Shapley values. Thus, the Shapley value for the game u is (14, 1, 19). 
Is this a reasonable arbitrated outcome for u? ‘To us it seems ques- 
tionable, Certainly, if either the game v or w is treated in isolation, then 
the coalition {1, 3} makes no sense at all. But in the game uw, or what is 
the same thing if v and w are being jointly considered, it does make good 
sense. It remains true that, as a coalition, they command nothing, but 
they also can hold 2 to nothing. So a bargain exists between {1, 3} and 


‘ oe two units for them to share. There are two possible outcomes, 
a : : ‘ 
©A of which seems plausible. Either: 


oe Coalition f133}rand |{2} receives an equal share of the total 
2 oe gain, 2 units, and the outcome is the Shapley value, or 

ake. ach Player receives an equal share of the incremental gain of 2 
. 8!ving rise to the arbitrated outcome (24, 24, 24), which is different 
| the Shapley value. 


ken accepting case (ii) as the fair arbitration of this game, the 
a. value is discredited for that purpose; thus, we need only give our 
‘On to those who find the reasoning leading to case (i) agreeable. 
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The Shapley value is, of course, (9%, 26, 13). In the game w’ one would 
expect the coalition {1, 2} to form and to bargain with player 3 about his 


potential contribution of 1 unit to {1, 2}. Again, two possible divisions 
seem plausible: 
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1, 
u’({2, 3}) i 0, 
= 2. 
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i. {1, 2} and {3} each get one-half of the marginal unit, which gives 
rise to the outcome (34, 34, 14), which is not the Shapley value, or 

ii. Each player gets an equal share of the marginal unit, giving rise to 
(%, 5%, 14), the Shapley value. 


Thus, whichever of these plausible arguments you prefer for arbitration, 
there is an example where it fails to yield the Shapley value and the alter- 
native argument does arrive at the value. The basic trouble, as we s¢¢ it, 
boas ens Pley’s third condition is that it is unreasonable to demand that 
Players involved in two games play each in isolation of the other. Non 
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APPLICATIONS OF 
1-PERSON THEORY’ 


12.1 THE A PRIORI POWER DISTRIBUTIONS OF VOTING SCHEMES 


Possibly the most interesting published application of n-person game 
theory (as of late 1956) to a social science problem is an attempt to esti- 
mate the a priori power distributions inherent in various legislative voting 
procedures. In a very readable article, Shapley and Shubik [1954] have 
suggested that the notion of value discussed in the preceding chapter is 
Suited to this purpose. Indeed, if one accepts the three conditions Shap- 
ley stated as necessary for an a priori evaluation of a game, it is the only 
function which is suitable. 

Consider the passage of bills at the federal level in the United States. 

nl most cases, there are only two ways a bill can be passed: either by a 
simple majority in each house of Congress plus the president’s signature, 
or by a two-thirds’ majority in each house overriding the presidential veto. 

l other combinations fail to pass a bill. Let us treat the president, the 
“nators, and the representatives as the players in a simple game, i.e., a 
Same whose characteristic function assumes only the values 0 or 1. A 

“oalition” is defined to be winning and to have the value 1 if it can pass 

‘We h 


theo nave not attempted to summarize the complex of applications to economic 
"y discussed by Shubik in Competition, Oligopoly and the Theory of Games [1957]. 
253 
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le to compute the Shapley value fot é esis 
the formula given in secuo! The tate 


pretation of that formula reduces to the following: t] e ility that 2 
given individual will be pivotal in transforming a losir gc ilition into a 
winning one when the final winning coalition is built up random selec. 
tions from among the players. It is true, of course, that in an existing 


legislative body the formation of coalitions is not random; however, when 
considering a voting scheme in advance one cannot know what deviations 
from randomness will occur, so it may be argued that the value gives a 
suitable a priori estimation of relative power positions. 

Whether we are willing to accept the value as such an estimate depends 
largely upon our willingness to accept Shapley’s third, and controversial, 
condition. For this situation, it says that, if a player participates in two 
such legislative schemes, his a priori evaluation of the two together 1s 
simply the sum of the values he would assign to the two schemes inde- 
pendently. Thus, for example, were we to consider not only Congress as 
teres ad ao committees as a separate oy 
game consisting of Con Rie, “ae — bie ee es: < 
the values of the two an wc anne idl 
Bytes thet insofar as ais nt games. In effect, then, the condition 

priort evaluation is concerned, we must sup- 


pose that there j i i 
ek S NO power interaction between Congress and its com 
; this strikes us as unrealistic ; 
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42.2] 
‘ution unsuspected and unintended by the authors of the revision.” 
istrl - ode , 
a p 787.| Without committing ours lves as to the naivete or inten- 
ae ie 5a : 1 that ro ge] ; 
U s of its quthors, it is amusing to compute the a priort power distribution 
t10n 


Ta Iations Security Council and t speculate on the propa- 
of the United Natio Sater 


wanda value these figures might once have had. It will be 

cae Council consists of eleven members of whom five—the “big 

have vetoes. To pass a substantive resolution there must be no vetoes and 
seven afirmative votes. Shapley and Shubik report that 98.7% of the 
power lies in the hands of the “Big Five” and only a total of 1.3% resides 
with the other six members. “Individually, the members of the ‘Big 
Five’ enjoy a better than 90 to 1 advantage over the others.” [1954, 


791.) 
: There is little point in reporting the calculations of other special cases; 
‘the reader is interested in the a priori power distribution of a particular 
legislative scheme he will find that the computation is straightforward, 
though it may be laborious. 
It is important to emphasize the nature of the measure used in these cal- 
culations; Shapley and Shubik make the point forcefully: 


In a multicameral system such as °° ° [Congress], it is obviously easier to 
defeat a measure than to pass it. A coalition of senators, sufficiently numerous, 
can block passage of any bill. But they cannot push through a bill of their own 
without help from the other chamber. This suggests that our analysis so far has 
been incomplete—that we need an index of ‘blocking power” to supplement the 
index already defined. To this end, we could set up a formal scheme similar to 
the previous one, namely: arrange the individuals in all possible orders and 
imagine them casting negative votes. In each arrangement determine the person 
whose vote finally defeats the measure and give him credit for the block. Then 
the “blocking power” index for each person would be the relative number of times 
he was the “blocker.” 

Now it is a remarkable fact that the new index is exactly equal to the index of 
Cur original definition. We can even make a stronger assertion: any scheme for 
oe power among the members of a committee system either yields the power index defined 
above or leads to a logical inconsistency. [1954, p. 789.| 


The precise formulation of the last statement, which we would regard as 
stand as it stands, is the theorem stated in the preceding section. 
“s a and Shubik’s assertion is true if we accept the three conditions as 

ry to a “scheme for imputing power,” and not otherwise. 
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2 POWER DISTRIBUTIONS IN AN IDEALIZED LEGISLATURE 


= 2% tori evaluation of power distributions based upon a random selec- 
o. Of participants to form a winning coalition may be suitable for dis- 
“a ‘sing proposed legislative schemes, but it will hardly satisfy a political 
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ple should not be ta en pean h ed for that—hure.s, 
sional power distributions—it 1s much too simpiihi ae : Dut rather 
as an effort to demonstrate that some of the techniques of game theory 
may be suited to such a study. id 

As in the Shapley-Shubik model, Luce and Rogow [1956] suppose that 
the legislative scheme is described by the characteristic function of a simple 
game. The result of passing a bill is certain “‘power” rewards which are 
distributed among the members of Congress; in our previous language, 
such a power distribution is taken to be an imputation. One idealization 
occurs here: it is supposed that power is a divisible and transferable com- 
modity, and in some measure it is but certainly not in the neat ways of 
game theory. The general problem is to find which power distributions 
coupled with which coalition structures are stable. A simpler problem, 


the one Luce and Rogow attacked 
structure such as the two- 
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Ms acP a aR A CeC a 
Considering each of these special cases alo : 
tions one can make about the two-party coalit st jori 
party has simple majority only or a two-thirds majority, and the pres! 
att) ° : G45 i, 
js or is not a member of the majority party- there result a total of 36 dif- 
N ) \ 


ferent Cases. Each case is examined separately, using the definition of 
y-stability, to determine which imputations with the two-party coalition 
structure are stable. Actually, Luce and Rogow did not use the defini- 
tion as it is given in section 10.1; rather, they waived the second condition 
which states that a person is not a member of a non-trivial coalition unless 
such membership is profitable. Their rationale for this change was that 
passing a bill is not a one-shot affair but part of an ongoing process, and 
so coalition membership might well be sustained, at least for a period, 
even if it does not produce immediate rewards. This difficulty, and its 
arbitrary resolution, raises the question whether it is often feasible to 
isolate a game from its more general social context and to suppose that it 
can be played without regard to these outside factors. In principle there 
is no problem, for either the game can be enlarged to include these fac- 
tors or the utility functions of the players can be chosen to take them into 
account. In practice, however, neither alternative is particularly useful, 
and so certain ad hoc tricks have to be employed. 

Note well how oversimplified this model is. It fails to take into account 
many facts of known importance: the interaction among bills, the dis- 
parate returns of power and prestige depending upon which coalition 
passes the bill, the whole role of the important congressional committees, 
the possibility of filibuster in the Senate, etc. On the other hand, such 
limitation on party defection as pressure from constituents, party disci- 
Pline, pressure from lobbies, etc., are built into the description. To be 
sure, it would be very difficult to specify exactly what function p holds 
ee a particular vote, but this does not prevent us from examin- 

Possible functions of the given general type to determine what gen- 

ral conclusions hold. 
i i eee of the stable distributions for a particular function y 
example a and the reader is referred to the Luce-Rogow Paper for an 
litde fe : is worth noting that for the given assumptions it amounts to 
iscussing E = a formalization of the ordinary arguments one uses when 
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reality, they are in a form which lit : 
they can be evaluated by him in the light of current theory and data 


They are: 


1. In all cases the arrangement of Congress into two opposed party coalitions is 


stable provided the power is distributed as indicated. In very many cases, how. 
ever, it is necessary to form coalitions other than along party lines in order to 
produce a winning coalition, i.e., to pass a bill. In only one case are the limita- 
tions so stringent that no working majority can form: this is when the president is 
of the minority party and will not defect to the majority, the majority has only 
a simple majority even with the defectors from the minority, and the minority 
does not have a simple majority even with the defectors from the majority. 
What is interesting is that in only one case of the 36 can such an impasse result 
re . | ae the president is weak when the majority party 

of it or not—has a two-thirds majority. If this model 


has i i 
any relation to reality, we must conclude that a president should fear a real 
congressional landslide for either party. 
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12.3 AN EXPERIMENT 


Notably lacking in all of our discussion have been data, or even the 
mention of data. In part this may be attributed to the realization that 
game theory is inadequate as a descriptive theory; human beings simply 
do not have the perception, the memory, or the logical facility assumed 
by any of the theories. But two other reasons are actually more impor- 
tant. Assuming that we wish to carry out empirical studies of a coalition 
theory, it is necessary to know the characteristic function, and this would 
seem to entail knowing the normal form. We have already pointed out 
the great difficulty of determining the normal form of most existing game 
situations and, even assuming that known, the extensive calculations 
required to solve the two-person games on which the characteristic func- 
tion is based. Yet without the characteristic function, we cannot know 
what any of the theories predict. A second difficulty exists even if the 

characteristic function is assumed to be known: what does the principal 

theory—von Neumann and Morgenstern’s solutions—predict? In dis- 

cussing the outcome of an experiment performed at RAND, Kalisch, 

Milnor, Nash, and Nering remark, 


It is extremely difficult to tell whether or not the observed results corroborate 
the von Neumann-Morgenstern theory. This is partly so because it is not quite 
a what the theory asserts. According to one interpretation a “solution” 
ents a stable social structure of the players. In order to test this theory 

€quately, it would probably be necessary to keep repeating a game, with a 
in — of players, until there seemed to be some stability in the set of outcomes 
a ee ered. One could then see to what extent the outcomes of this final 

ie Inmate each other and to what extent other possible imputations are not 

nated by them. [1954, p. 313.] 


: Bevo difficulties, the second seems less important, for even if it 
.. Possible to interpret solution theory there do exist other theories 
- are geoly given empirical meaning. ‘The problem of determining 
hn pee Seistic function is more profound, and it seems to us that the 
me _ eae development for empirical verification will be a practical 
Probab| - calculate the characteristic function of actual situations, 
atea ig 4 ; € most significant contribution social scientists can make in this 
teristic = paele method for the approximate determination of charac- 
Ctions. In section 12.4 we shall describe one such proposal, 
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malization, and to others in an S-equivalent tor sUDJECcts were 
told what each coalition would receive if it formed, and they were given 
10 minutes to form coalitions and to agree upon payments, the agreements 
being reported to an umpire. He announced the agreements to the 
group, and if there was no dissension he held the players rigidly to their 
formal agreements at the end of the bargaining. ‘The authors point out 
that there were, in addition, numerous informal agreements which were 
not processed through the umpire but which were kept in good faith. 
We feel that the general qualitative impressions of the authors are of 
sufficient importance to be quoted at length: 


There was a tendency for members of a coalition to split evenly, particularly 
pe the first members ofacoalition. Once a nucleus of a coalition had formed, 
ae ae a re to exacta larger share from subsequent members ofa 
Bpieared to be auc ency lor an even split among the first members of a coalition 

» in part, to a feeling that it was more urgent to get a coalition 


fe 

ae ae : ae much about the exact terms. 
eature of th ini — 

ee € bargaining was a tendency to look upon the coalitions 


ve ing 

the fact that some re Se only Ones worth considering, often overlooking 

mutual benefit - - - yers could gain in a coalition with a negative value to thel 
Coalitions ‘ 

of mo : 
from smaller in persons seldom formed except by being built uP 
bargaining between ive hg Coalition forming was usually also a matt! 0 
0 
A result of these tend Bae tt than more. 


two- ned encies w i . ras the 
~ Person coalition with as that the coalition most likely to form was t 
always re did 10! 


; jnto 4 
tage was most likely to get , 
alitio” 

tins 


Nt tog symmetr; € lately, ter the umpire said “go,” and to ©? wiv" 


ric ven in . i . 
Same, the pla a game which was strategically reas” 


42.3] 
might ne 


hat some players felt they Wel e better off tl 
got into coalitions, while others felt th 
they 8 t into coalitions. ‘They seemed to pay 

7 en of the coalition was the same to all 
+. differences between the players we \ 
tendency of a player to get 0 uae nage - ‘© ay 
ralkativeness: Frequently, w en a oe ition formed, 
took charge of future bargaining for the coalition. In man} 
layed a role even in the first formation of a coalition; 
joudest after the umpire said “go ’ made a difference in t 

In the four-person games, it seemed that the geometrical arrangement o1 U 
layers around the table had no effect on the result; but in the five-person game, 
and especially in the seven-person game, it became quite important. T hus in the 
five-person game, two players facing each other across the table were quite likely 
to form a coalition; and in the seven-person game, all coalitions were between 
adjacent players or groups of players. In general as the number of players 
increased, the atmosphere became more confused, more hectic, more competitive, 
and less pleasant to the subjects. The plays of the seven-person game were simply 
explosions of coalition formations. 

Despite the exhortation contained in the general instructions to instill a com- 
pletely selfish and competitive attitude in the players, they frequently took a fairly 
cooperative attitude. Of course, this was quite functional in that it heightened 
their chances of getting into coalitions. Informal agreements were always 
honored. Thus it was frequently understood that two players would stick 
together even though no explicit commitment was made. The two-person 
commitments which were made were nearly always agreements to form a coalition 
with a specified split of the profits, unless a third player could be attracted, in which 
case the payoff was not specified. This left open the possibility of argument after 
a third party was attracted, but such argument never developed. -In fact, 


a: ne principle was always applied in such cases. [1954, pp. 


We have quoted at such length for three reasons. First, it is important 
when evaluating the results that the reader have some flavor of the proce- 
dure and of the performance. Second, it is interesting that the coalition 
coe were effected, in the early stages, one person at a time and, in the 
a ages, by one small coalition joining with another. Third, certain 

P 2 of the experimental procedure seem undesirable and could easily 
‘ fetes The geometrical effects, though possibly interesting in 
; oe ications, are not desirable in a study of human response to 
ied ee functions. To eliminate these one might employ telephone 
group ee or a variant on the Bavelas partitioned table for small 
se: (Christie, 1956]. The latter wou d require the use of 
the ot isi which incidentally would give a permanent record of 
Municatio, = It would have the slowing effect that any written com- 
this on i but it is not clear that this would be any disadvantage in 

* Further, in the small group work it was observed that a high 


plications of n-Person Theory “ 


262 Ap D Ex eriment for the F ? son Const 
12.1, Results From RAND “xpe” v({1, 3}) = /  ~Onatant 
an! Sum Game: v({1, 2}) = 74 a an, 
Imputation 
oe ition 
Players ture 
1 2 3 a us 
ee —— 
a 00  .40 .30 .30 1}, {2, 3, 4] 
Game 1 : “00 43 43 o>. | 1}, {2, 3, 4) 
3 1B .38 .38 Po |} (U1, 4), {2, 3] 
4 fee 00 S| 1, 2, 3}, 4} 
Reso 13013 | 11,21, 3,4 
6 Seen oo 15 | {1, 2, 44, 133 
7 oe 448 oe | {1, 2, 3}, {4} 
8 44. 44 .00 ya (1, 2, 4}, {3} 
Average .20 43 225 et Sein 
4 1 og 25. .38 , 3, 4}, 
Bee 2 .00 .42 -42 el? {1}, {2, 3, 4} 
3 moe 00.  .46 525 it, 3, 4}, (2) 
ni Runs 4 8. ..54 .00 .08 {1, 2, 4}, (3) 
i Pee sa 00 = 10 (1, 2,4}, (3 
| 6 ess. 38. «13 (1, 4}, 12,5] 
vi 7 Peees4 .00 .08 (1, 2, 4}, 131 
vi 8 a ds f1, 3,41, 121. 
| i Average 286» ..30 24 .18 
| Value 25 :33 25 TF 
Vi) Quota esd = 25 00 
| : yee data have been adapted from Kalisch et al. [1954] by transforming a 
ae he 2 rmalization. Game 1 was presented to the subjects in what amoun 
normalized form; game 4 was in an S-equivalent form. 
d . 
cir; nthe ty Was Preserved, and this might allow more rl 
eas. Het an was obtained at RAND. One may also question “* 
urgency Bik, cin as pase poi it p pobably Sate i eo 
It is doubtful that Ber omit the Players to reflect about their dec!s 


in 10 minutes, 


Each of t it 
subjects oe ue Perton Sames was played eight times, a total of eigh 
for each play of a Cyed. Changes in the grouping of players were ma 
: € game to Prevent the formation of permanent coalitio s 
“sum, the characteristic functions ” ”’,. 


’ 
: oo 

ame j by their Values for three two-perso” 

© Is described by: 


ee and -1(/1, 4}) = 4 


12.5] : | 
LE 12.2. Results from RAND Experiment for the Symmetric Four- 
TAB Person Constant-Sum Game: v({1, 2}) = v(il, 33) (11. 4}) 
Imputation 
Players | Structure 
1 2 3 | | 
Game 2 1 45 “15 38 05 | L Ay, £29.33 | 
2 48 .20 ao c(i es 9 AP P85 8 
3 19 Ay onl gies wr SA IZ SH 
Runs 4 25 31 44 00 bai D Shi [4g 
5 21 19 31 29 fie Heal 2 3t 
6 28 9 31 BPE, At (20g 
7 00 .40 51 .09 {1}, {2, 3, 4} 
8 00 30 43 28 a eel ae Pi: 
Average x) :20 38 .16 
Game 3 1 Bs) 25 25 me) (4, 2}, {3, 43 
2 .00 26 36 38 {1}, {2, 3, 4} 
3 34 33 34 .00 f1:2) Shs (44 
Runs 4 38 36 .00 26 bas De Abel Bt 
5 25 25 25 25 Ne VAD 
6 .00 36 28 36 A OR 
7 25 20 25 ZS Gar as 
8 225 AS) A) 25 (1.253, 4) 
Average al 29 Be) 325) 
Value ano 485) #42) 15) 
Quota 25 225 25 125 


ES San 
_ These data have been adapted from Kalisch et al [1954] by transforming them 
into 0, 1 normalization. Game 3 was presented to the subjects in what amounted 


to normalized form; game 2 was in an S-equivalent form which concealed the 
symmetry. 


the second game by: 


v({1, 2}) = o({1, 3}) = o(f4, 41) = 72 


ee ay to these respectively as the non-symmetric and symmetric 
Mien o see the exact form of the characteristic functions given to the 
, consult p. 305 of Kalisch et al. [1954]. 

. al of the imputations (normalized) and coalition structures 
—— ose in these experiments is given 1n Tables 12.1 and 12.2. Prob- 
a f, striking feature about these data is the apparent difference 
Osa) oe behavior in the S-equivalent games. Whether or not there 
Wate to . erence is, however, difficult to say. It is by no means ade- 
ok at the two average n-tuples and to state that these exhibit 
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| differences which are beyond exper! 
If we possessed what we wer« ° Wha 
Bie 


the average means. a ay 
equilibrium theory, then we could expect gl O ns whi 
occurred to be predicted by that theory; howev wee 
reason to expect the average imputation to be on feted é 
that theory. Nonetheless, intuitively one senses t! lifeve ‘ 
in the response of the subjects to S-equivalent gan he : . 
tempted to conclude that the subjects did not alwé wii hana 
the matter. Some of the analysis given below sug t the wee 
must be given a somewhat more subtle interpretatio: n this. be 
Let us consider the relation between data and theory for the severa| 
theories which have been offered. “3 
Core. Since these are constant-sum games, the core is empty. 
Solutions. As we suggested by the quotation at the beginning of thi 
section, the experimenters did not know what the von Rfid Waibe: 
a Bet asserts for the experiment, and so no comparison was ie 
y-stability. The prediction of stability theory depends, of cours 
the choice of the boundary condition y. For example, \ - oe, se, upon 
the function defined in section 10.2, i.e., an egaliti Pies hime mp choose 
a player not in the coalition or it ae ‘ + oe a poe adeip 
then the only stable imputation is Sh ee cE em e ory one from i 
case (Table 12.2, games 2 and 3) th apley’s quota. In the symmetric 
(qid}; {2}; BPO ta the fo the only stable coalition structure is 
4) both that structure and ({1 e., metric case (Table 12.1, games 1 and 
without exception these ek 9 js (4}) are stable. We see that almost 
as the imputations are conce oe are not confirmed; however, s0 far 
direction for the games in 0) ied the predictions tend to be in the right 
All this is none too che norm 
Let us, th f Convincing, ho 
p) ere ore, reconsider the 


suggestion that . 

a different functi 
may consider expel] a 
sider adding a mem 


mental error; f 


alization, as is argued in Luce [1955 4): 

wever. 

experimenters’ comments. ‘There's 2 

ing a single . be used, namely: a coalition 

ber who ig 3 er, as before, but it can only con 
ot already in a coalition. This is to $4’ 


a Coalition, 

» Luce [1955 c] has obtained the predic 
rss For clarity, the data have bast 
quilibrium Coalition, and a comparis0" 


Predicti Se 
Ictlons is given. We have required that 
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agnitude of the round-off error i \trod 
normalization. It should be noted Si 
the limitations on the imputations are les 
suggestion. For example, when there are 

redictions are essentially trivial, and so 

further. In the other cases non-trivial pred 

the symmetric game, the data are compatible with 

out of the 9 non-trivial cases. Run 8 of game 3 probably should not be 
interpreted as a failure for the followin 
pair consisting of the quota and ({1 ot 
nature of the experiment this structure could never achieve an imputation, 
since the total payment to the players would be 0. Thus, in practice, the 
only way to achieve an imputation with this coalition structure is for the 
four players to call themselves a coalition, and divide the proceeds accord- 
ing to the quota; this is how they were, in fact, divided. For the non- 
symmetric game there are 13 non-trivial cases (i.e., cases where there are 
no two person coalitions), of which 10 confirm the theory. In one of the 
failures (run 1 of game 1) the error is five percentage points in a prediction 
of 75. In the other two (runs 4 and 7 of game 1), the observed coalition 
structure is stable only if the imputation is the quota, and the data differ 
considerably from that. 

Thus, if we ignore the questionable case of the coalition of four players, 
there were a total of 21 non-trivial cases, of which 18 yield data in agree- 
ment with the theory. Of the three failures, one disagrees by only a small 
amount. In all three cases of failure, the runs involve the S-equivalent 
form of the non-symmetric game, not the 0, 1 normalization. If we are 
willing to accept this theory as accurate, then these results certainly recon- 
firm the belief that the subjects did not fully grasp the logic of the (non- 
symmetric) games presented in S-equivalent form. Furthermore, if 
Tables 12.3 and 12.4 are examined, there appears to be a tendency for a 
Particular coalition structure to appear either in the 0, 1 normalized game 
or in the S-equivalent game, but not in both. This certainly suggests that 
the mode of presentation affected the dynamics of arriving at a stable 
oe and therefore the chance of it arising, but that it had less effect— 

gh still some—on the group decision whether a pair was stable or not 
once it was reached. 
Se outcomes. Only once in the four-person games did a 
ol - much as or more than the bound 6(2). In no cases did the 
restriction f these games lie outside the set L; however, it is quite a weak 
oo — or these two games. In both cases l(S) = 0 ford having 0, de 
Dieses 8 ae iWa =A; » In the symmetric case, /(S) = 4 for S having 
- Inthe non-symmetric case /({1, 2, 3}) = 14, and for all other 


reason: the theory says tnat the 


Cc 
? 15 Totes 1 
2}, {3}, {4}) is stable, but by the 
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licat . (I, 
266 ere Comparison of Results a i Non: 1C Foy, Pp | 
- xpel “Te "7 
TABLE 12.3 Constant-Sum Game (R h VStapi 
Predictions ee / 
Observed Mpatibili, 
Coalition structure | Game Run Imputation een Pi, 
and Corresponding No. | No. 4 bei : lit 
y-Stable Imputations ee 
a ——_—__—_- y > — 
1 foo .40 .30 ‘3 = 0.70 <9 % 
sae A 4s ey 2 |.00 .43 .43 .15/ None Bi 
xg + X32 0 Meee 00 42 42 .17| Non 
xo + X4 ez 0.50 
ert +4 2 0.25 
2h. {1, 3, 4}) 4 1 35).00 .25 38 None 
x1 + x3 > 0.50 4 3 |.29 .00 .46 .25| None 
eee, > 0.25 4 feezo) 00 .42 .29! None 
x3 + x4 > 0.25 
xo = 0.00 
Oo ieaid, 2,4}) 1 6 |.43 .43 .00 .15| None 
*1+ x2 > 0.75 1 8 |.44 .44 .00 .11] None 
%1 + 4 > 0.25 4 4 |.38 .54 .00 .08] None 
x2 + x4 > 0.50 4 eeieoy 53 .00 .10] None 
x3 = 0.00 4 7 |.38 .54 .00 .08] None 
He ee t)) Neca 1 4 |.13 .44 .44 .00) Incompatible 
= a >%2 = 5 1 att 
= 0.25 +, =0 00 7 |.19 .44 .38 .00| Incompatible 
({1, 4}, (2.3 Z a 
ee) 1 3 |.13 .38 .38 .13] None 
ks = 0.75 6 pal} .38 38 bal None 
12) {3, 4}) ae. | a 
Ar X= 0.75 5 |.25 .50 .13 .13] None 
x3 + | 0.25 
The functio ° fd 
NW is described ; a, ue 
2a of non trivial ae in text. In all predictions, the condition * og 
com on 18 required to hold ae 18 omitted ; it was always confirmed. or of 
Putations, Only to the nearest hundredth, the round-off er 
three-perso 


D Coalitj 
+ on: = 
We describeg at the ee me 
& ere 2 
4 Stability analysis—py 
OY i 
a T€atin 


t 
anny OD 
Ly jation 
74. For these games, the ee ditio” 
Ction 11.2 yields exactly the sam° bow’ : 
rovided, of course, the suggesté oat {0 


at d 
ie, be '™portant adv. °! this type, this variation of oh 
’ Olce of 4 antage Over ¥-stabilit that no arbitrary pou! 

p eerequi y thing 


i 
ed. Furthermore, it says so™ 
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i s from the Symmetric Four-Person Con- 
12.4. Comparison of Results om Four-Person Con 
TABLE stant-Sum Game (RAND Experiment) w ith w-Stability 
Predictions 
‘Coalition Structure | Game} Run Ob 
and Corresponding | No. | No. imputation betwee 1 W-Stability 
y-Stable Imputations | ‘eae [Theory and Data 
({t}, 42,35 4H) 2 Rat 20s 20-51 09. | Nane 
st xa > -50 2 | 8 | .00 .30 .43 .28 |Non 
xe + x4 > 50 a 2 | .00 .26 .36 .38 | None 
xg *4 > 50 3 0 SOORS6).3 28:36 | None 
Xi = .00 
({3}, (1, 2, 44 | 3 | 4 | .38 .36 .00 .26 | None 
ni tx > -50 | | | 
xy + x4 2 50 | 
wt x4 > 50 
= .00 | 
1({1, 2, 3}, £43) 2 | 2 | .48 .20 .33 .00 | None | 
x1+ x2 > .50 2 4 .25 .31 .44 .00 | None 
x, + x3 > .50 3 =: .34 .33 .34 .00 | None 
Xo + x3 > .50 ; 
) x,= .00 ) 
| : x 
Minin {2, 3}) Z 1 .45 .13 .38 .05 | None 
| ata = .50 2 | 3 | .19 .19 .31 .31 | None 
) *2+x3;= .50 2 5 942 49 32° 29 PNone 
| 2 6 .28 .19 .31 .23 | None 
(U1, 2}, (3, 44) 3 1 | .25 .25 .25 .25 | None 
. nee - .50 3 7 | .25 .25 .25 .25 | None 
. st x,= .50 
i 
((1, 3 = e 
gi ti {2, 4}) 3 5 | .25 .25 .25 .25 | None 
| “1 x3 = .50 
*2+x4= .50 
<< 
(Ht, 2, 3, 4}) 3 g | .25 .25 .25 .25 | Incompatible 
None 
(see text) 
| ain 
for € function y is described in text. In all predictions, the condition x; > 0 


com nbers of non-trivial coalitions is omitted; it was always confirmed. The 
| of the non is required to hold only to the nearest hundredth, the round-off error 


—= 5 
a 
a =. So 
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restrictions 0 , independent of any es 5 | tions [ 
on (ii) on P- 242], and these may pe sar 1 MOTE com, 
: heir original presentation | i tees 
plicated games. In their Ole ith the upp 1959] 
Kalisch et al. also compared the data with the up} , and the 
found that in most of the experimental runs at least ceived mor 
than d(S). They concluded that « . + - the functi “ems to have 
no relation with the way the game was actually pla 52, p.27 
Value. Although there is no particular reason ( ct any specif 
equilibrium outcome to be the Shapley value, one might argue, much a 
we did when introducing that concept, that it should predict the average 
equilibrium outcome. If so, then it makes sense to compare the average 
imputation with the value. In the symmetric game the value and the 


quota are identical. It will be seen from Table 12.2 that the average 
imputation for the game presented in 0, 1 normalization (game 3) is 
quite close to the value, whereas the average for the S-equivalent game 1s 
not. For the non-symmetric game, the reverse pattern scems to be true. 
Actually, the value is not a bad indicator of a player’s expectation when we 
ees over the 16 cases of each game, without regard to the 
ee ee of the characteristic function. However, i 
the dynamics of ied ae a ae gt a pai oe 
Bia Guornes coc . a not affecting the existence ip 
expect the value to predict the at ae a a.’ ae at 

age imputation. For, by vary!ns the 


mode of i 
Presentation, we should be able to change the probability ot 


various equilibri i 
; rlum imputation ’ 

; i u : 
imputation, Ons occurring, and thus change the average 


from this experiment? This is difficult © 
b itis a pat the results do not coincide exactly with 
y the experimental tech question how much the outcome was influence 

i chnique. One senses from the author’s commen!s 

y felt, and that seems to be an ideal w?) 
g all knowing—a basic assumption of the 


: ye 
€trical obstacles to coalition format!” 4 


Furthermore, the geom 

ai henom 

stability theory by eo ae 
ans of 


P oO Sames. : 
imputation ‘ ne would conclud As far as confirmation ° 
may b 0€8 lie in the ude from these d equilibt 

€ the sets B ata that an €9 pict 


resul and 3 js 
t of €xperim i. Also, with a few exceptions V ‘as! 
ndav? 


¥ such ent 
t al techni 
hat the outc chnique, there does exist 2 bo ere 


r : Ome 
functions Which w S are ¥-stable pairs. Of courses ti 
ould do better, At least for this * 
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ment, 
tion, but we 


the value seems to be an adequate p 
doubt that this is a general propositio1 Che stability analy- 


js certainly suggests that it is reasonable to treat the outcome as a pait 
consisting of an imputation and a coalition structure, since the data 
become quite coherent when grouped according to coalition structures, 


Possibly the most significant fact suggested by this experiment, and one 
we expect to be generally true, is that subjects not only respond to the 
strategic aspects of the characteristic function but they are also influenced 
by its mode of presentation. This is almost certainly true for the dynamics 
of coalition formation, and there is some indication that it may be true for 
the equilibrium behavior of players. 


12.4 ARE ““REAL”’ GAMES EVER “‘ABSTRACT”’ GAMES? 


It is trivial to create an experimental situation which satisfies the rules 
of an extensive game (see section 3.3), but this does not ensure that it will 
be a game in extensive form, as we took pains to point out in section 3.5. 
Three conditions beyond the rules were added which must be met before 
itis a game. These were interpreted as describing the players: each 
player has a utility function over the set of lotteries generated from the 
outcomes, each attempts to maximize his own expected utility, and each is 
assumed to know the extensive game in full—in particular, to know all 
of the utility functions. If the game is taken in normal form, these 
assumptions remain the same except that the players are each assumed to 
Know the structure of the normal form in full, i.e., each knows the strategy 
“ls and the payoff functions of all the players. 

Since we interpreted the maximization principle in such a way that it 
ae etlly true, the first two assumptions are both verified simply by 

Ing that a utility function exists for each player. Certainly, at 

a. it cannot be claimed that this has been shown to be true (even 
aed in a wide variety of situations, but there is some slight 

tituations rom simple experiments on the utility of money in gambling 

Atleast. pceeating that it may not be totally unrealistic (see section 2.8). 
‘ah ere is the possibility that such functions exist, which leaves the 


‘Q—; 


hin the knowledge—assumption to be considered. Possibly it is met 


, “in extremely simple situations, but in any experiment of significant 
ity or in any situation occurring in life we seriously doubt that this 
tion is tenable, even as a first approximation. If that be so, then 
ced to admit that the answer to our section heading is No—that 
esting cases of conflict of interest are not in fact games either in 
Normal form. 
mitted this, the question arises as to what they are. There 
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= to be two conceptually — proba 
— a realistic—say$ that each hat wean 
= utility functions of the others, # a , reat hi 
of decision making under unc is point ¢ 
> agen as me extent in the next éoeas 


view will be explored to so 


‘nto the general problem of individual decision der unger. 


tainty. The other suggestion attempts to _ | h heory rede 
work slightly in such a way as to weaken the know aug : imptions, but 
at the same time, to continue to utilize some of u malism of game 
theory. Possibly the most important feature of this ge! ralization isthe 
technique it suggests for overcoming some of the difficulties in finding 
characteristic functions. However, the generalization is full of weak. 


nesses. In addition to those of its own, It has most of the shortcomings of 
game theory: there are no suitable sociological assumptions in the underly. 
ing structure, and it supposes the existence of a transferable utility (see 
sections 7.7 and 10.4). Be that as it may, let us examine the idea briefly; 
for a fuller statement see Luce and Adams [1956]. 

Although each player may not correctly perceive another player's pay- 
off function, it is still conceivable that he will behave as if he postulates 
utility functions for each of the other players which he “‘believes” they 
are trying to maximize. This we shall assume. Thus, to each playet! 
a will be associated n payoff functions M SS ar M; 
anc 2 pre on 
inci... oe (when in fact it is M;;’), and M; i: 
ghange, the model remain. ee pting to maximize. Except for! * 

€ same: each player has a set Si of Pit 


strategies, and th 
€ others know thi acuee 
Own payoff function MX. this, and each attempts to maxim! 


pris Participating j : The only difference is that each player thinks 
game with the wey fmt ame, €.g., player i thinks he is in the 
eerne with, payoff functions M;}, M2... - ; a, ES jin 
Payoff: al Be: eva 
structure is rae S M; ’ M;, ee : M J a n etc. Such 4 


when Mj = Mi 


: it 18 
om the most general possible, and “ 
gam ©xperimental game-like situations ae " 
es, Another generalization whic ao 

0 


1S to suppose that the playe™$ © fo 
i ategy sets. Such must be the ca 
ts is ai Competition, , 
Secret he dely €s ” Shiela 


>in mu €r’s str 


where research may © vel’ 
os Producer. By keeping such otil 
€r players as to his strategy 
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42.4] x . ; 
another form of erroneous ——!, seems common. layer j may have 
; perception of 7’s perception of k s utility funct ion W nich is, in fact, dif - 
ferent from eS perception of k’s utility functio: Of course, such mis- 
erceptions of misperceptions can be carried as ee steps as one chooses, 


put with little likelihood of profit. Indeed, whether it is valuable to go 
from a game to an m-game is debatable; certainly it has yet to be con- 


clusively shown. 

But to continue with the idea, since each player 7 believes he is in a game 
with payoffs M,’, a characteristic function 0; can be computed; this is 
called player 7's subjective characteristic function. From the objective game 
mvs, °° M,,* an objective characteristic function v can also be 
computed, but it appears to be of less interest. Clearly, if the m-game is 
in fact a game, 0; = 2 for all i. There are now two questions to be con- 
sidered, one theoretical and the other practical: What sort of theoretical 
superstructure can be raised on n subjective characteristic functions, and 
in what way is it possible to determine these subjective functions? 

The question of a theory is far from adequately handled by Luce and 
Adams, and there seems little point is reproducing their discussion here 
except to say that they attempt to reduce the structure once again to a 
single set function. As they recognize, their attempt is unsatisfactory 
because it rests on an ad hoc interpersonal comparison of utilities. 

Of more interest is their idea for dealing with the practical problem, 
which is, of course, of some magnitude. Determining a characteristic 
function was a serious problem when we had only one in game theory, 
and n of them surely does not make it easier. In principle, the solution 
exists: from each person find not only his own preference pattern, but also 
his beliefs as to the patterns of the others. From these construct the sub- 
jective payoff functions and then solve the necessary two-person zero-sum 
ames to get the subjective characteristic functions. But this is simply 
Not feasible. 

ne the Context of m-games it may make sense to try to determine 

a, a functions directly sythout passing through the normal 
Blesences i €a is almost trivially simple. The subject is to report his 
ing EE oiiona paired comparisons between coalitions and lotteries involv- 

; ns (including those coalitions he would not actually be in). 
ae meet certain consistency requirements, then a charac- 
is plausibi a can be constructed which, for reasons that will be given, 
let the eee ree 2 his subjective characteristic function. ; We could 

Which must eee : be an undefined primitive, as In utility theory, 

Ver, since eo a suitable realization by the experimenter. How- 

on Se are fairly specific alternatives, let us try to spell out 
e subject to have in mind when he says that he prefers one 
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is another. We would instruct him | ) | 1€ Moment 
ee 4 he is. Rather he is to approach we t structure 
which bi on the basis of his choices, \ gned to , 
as an outsider V 3 When deciding bet ‘time be 4 


the situation. +e IS 
t he would be placed randomly 1n ob € roles of 


Thus, he is to decide w wuld like to 


player role in 
to imagine tha 
the coalition he chooses. 


be an “average” member of one coalition or the | a lottery, a 
chance device with known probabilities will deci alition he ig 
to imagine he is in, and, as before, his role fee the payoff within 
that coalition will be randomly Mecided. “A mathem: result is given 
below which argues for this particular interpretation O: | ference. 


In whatever way we attempt to realize the primitive idea of preference, 
we shall suppose that it satisfies the several axioms of Chapter 2 which 
lead to a linear utility function. In addition we shall impose another 
axiom, one which makes a certain amount of sense for coalitions, though in 
general it is not meaningful. Consider two disjoint coalitions R and § 
with |R| and |S| members, respectively. We shall assume that RUS is 
preferred or indifferent to the lottery in which R arises with probability 
IR\/(R| + |S|) and S with probability |S|/(|R| + |S|). We may argue for 
this condition as follows: The probability of taking a particular player 
role in the two alternatives is exactly the same, since in the coalition RUS 
it is 1/(|R| + |S|) and in the gamble the probability of being any member 
of R is IR|/(|R| + |S) -1/|R| = 1/(|R| + |S) ai of being any member 
=e is |S|/(|R| + ISI) -1/|S| = 1/({R| + |S|). However, just as in game 
el eg ee ilitcs of RUS are never inferior to those of the 
particular player are “ie S, so, given that the probabilities of ae? 
Petit Rs. ame in both cases, he should never prefer 1 
Fro a 
know oe Sala preference meets the utility axioms, ‘! 
unique up to a Piha “eee by a linear utility function « which a 
ear transformation. From this funct!o? we 


generate a wh ‘ 
Beil nox ole Class of functions defined over the coalitions of the game 
1s v, where for all coalitions S 


(8) = elSifu(s) ~ u(ay) + J as 
and where ¢ is a hes 


Positive c con" 
generated as we let ¢ , 
9 @1, 42, ° +3 


ble do ; 
mat : di 
Ms. This class of functions has these pertine 


Over their Possi 
Properties: 


> 


Are “Real”? Games 
42.4] 


4, Any two members in the class are 
fic function §-equivalent to a mem! er of 
other words, a given utility function u ger 
of characteristic functions. | 

ii. Any positive linear transformation of 
erates exactly the same equivalence. class « 


does wu. 


Summarizing these three points: if a subject’s preferences among lot- 
teries of coalitions satisfy the axioms we have assumed they do, then an 
S-equivalence class of characteristic functions is naturally associated with 
his preference relation. ‘The suggestion is that these functions be inter- 
preted as the class of characteristic functions S-equivalent to his subjective 
characteristic function. One strong argument for doing so is the following. 

Suppose that a person actually does have a subjective characteristic 
function v for the given situation; this could be the objective characteristic 
function of the game, or it could be his calculation of his subjective char- 
acteristic function, or it could be arbitrarily given. It does not matter 
so long as he knows it numerically. Suppose that he is placed in the 
experiment described above and that he proposes to use v as best he can 


to arrive at his decisions. If his choice is between two coalitions R and S, 


and if he is to be randomly placed in the role of one of the players, a 
plausible index for comparing R and S is 


o(R) 2(S) 
VETSUS, Boar: 


iR| |S| 


. a lottery in which R occurs with probability and S with probability 


sa p, a plausible index is the expected value of the index for each coali- 
10n separately, i.e., 


v(R) +a v(S) 

| PRI P15 

a actually use this index to determine his answers, then it can be 

ave ae the resulting preference relation must satisfy all the axioms we 

a, and that the S-equivalence class of characteristic functions 

fen y the above scheme includes the given characteristic function v. 

a ae objection to this last result is the olecmation that, had 

Preference a v but rather some S-equivalent v’, in general the 

easily ¢ ame ation would be different. Indeed it would, but one can 

same . Se that the two different patterns must generate the 
It ; ence class of characteristic functions. 


Is . . . 
hot known at present whether this technique 1s experimentally 
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realizable and, if it is, whet 
d here are, it is true, spe 


her the axioms are m™ a 
cial cases of the al ernatives 
hey also possess certain peculiar feat n lead one 
tility axioms hold. These a he 
Juate dispassionate! h the coalition 


involve 
Chapter 2, but t 
to doubt whether the u 
complicated: a person must eva 


he is in and those he is not in, he must imagine what it is like to be an 
“average” member of each of these coalitions, and he must consider 
° ee . s = tax 12 1 th ane . . 
lotteries having coalitions as prizes. This seems so taxing ot he imagina. 
tion that one can fairly doubt that he will be consistent in the sense of the 


utility axioms. Furthermore, the added axiom, which ensures super. 
additivity, is also suspect. To be sure, the coalition RUS should be pre- 
ferred to the lottery when all of the players are thought to be rational, for 
it has the greater strategic potentialities; but, when the evaluations are 
obtained as suggested, it may well happen that a person will prefer the 
lottery on the grounds that effective cooperation in a larger group is more 
difficult to achieve than in a smaller one. 

Assuming, however, that the technique is feasible, it would certainly be 
interesting to know what pattern of coalition preferences the subjects would 
have in an experiment similar to the one described in the last section, and 
in particular whether they would faithfully reproduce the monetary 
worths of the coalitions. There is reason to suppose that utility for mon?) 
a. 0 with money, but in all likelihood this would be a small effect. 
ie Bead Sa leaies ih would arise from the failure of the alae 
acteristic function is a Ps * “aad . — ia ae ne 
that the experimenters Sei ; : oo onl y  . cal 
impressed by large monetar 4  — that their subjects were any 
incremental effects. If oe ee Paid estes Regard to = an 
subjective characteristic fun . eee? we might expect © — ‘ve 

ctions somewhat at variance with the object 
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INDIVIDUAL DECISION MAKING 
UNDER UNCERTAINTY 


13.1 INTRODUCTION AND STATEMENT OF PROBLEM 


Possibly the best way t 


where we discussed the 


whether it is by a group or an individual and a ccording to wh 


an inaiviaual anc at “aS 


Oeing carried out under conditions of certainty, risk, or uncert 


the ten intervening chapters we have been 


| lividual 
decision making in a very particular context of uncertainty known as a 
game. : 


‘ Ina game the uncertainty is due entirely to the unknown deci- 
“tons of the other players, and, in the model, the degree of uncertainty is 
reduced through the assumption that each player knows the desires of the 
other players and the assumption that they will each take whatever actions 
“Ppear to gain their ends. Traditionally, the game model is not called 
*cision making under uncertainty; that title is reserved for another 
Pecial class of problems which lie in the domain of uncertainty. These 
Se elgaia we shall discuss presently, have for the most part grown 
Rice oa — in the gaan = = —— much 
‘ E < < 7 Cc 
@propriate a - experimental evidence and in drawing 
275 
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problem is sim 


S Ai; Ag, 


ple to state. 


The gist of the Ams but the 1 


ne a set of act ail 
ee cpents upon which ‘‘state of nature’ pr¢ 
ac 


The term ‘‘state of nature” will be more pally exp 

the idea is intuitively clear. As the =? ‘ 

one of several possible things is true; ae 1 ae | ye. 
choice, but we do not even know the relative pt 4 i seth 
or, indeed, if it is even meaningful to talk about | oe dee 
which one obtains. A simple example will illustr: dilemuncan 
one is due to Savage [1954]: 


Your wife has just broken five good eggs into a bowl when you come in and 
volunteer to finish making the omelet. A sixth egg, which for some reason must 
be either used for the omelet or wasted altogether, lies unbroken beside the bow! 


You must decide what to do with this unbroken egg. Perhaps it is not too great 
an oversimplification to say that you must decide among three acts only, namely, 
to break it into the bowl containing the other five, to break it into a saucer for 
inspection, or to throw it away without inspection. Depending on the state ol 


the egg, each of those three acts will have some consequence of concern to you, sa) 
that indicated by Table 13.1. 


TABLE 13.1 
State 
Act ea “ 
Good Rotte 
Break i “ otten 
into bowl Six-egg omelet No omelet, and five good e888 
Break into : destroyed 
saucer Siege omelet and a saucer Five-egg omelet and a saucer 
o wa ai 
Throw away a to wash 


Five-egg omelet, and one 


Five-egg omelet 
good egg destroyed ee 


In general, to each pair (A 


; ei: - ie alia 
will be a consequence or 6 i, Sj), Consisting of an act and a state, th 
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mong hypothetical lotteries with i 
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netion (see Chapter 2). If we arbitrarily choo® 
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| mutually exclusive and exhaustive list 
| e relevant to this particular choice p 
ar 
gon maker . 

fren there is a natural enumeration ol th 
0 


is uncertain. Although t 


world in particular contexts. We assi 
the world which is unknown to the decision mak 


TABLE 13.2 


States 
Acts Si S92 et SS § 
Ay uj u\2 ujj Uin 
2 uo, UuU22 uo; u2n 
Ai Ti Ue A Re i 
Am Ae tim? ee ae ema SY Umn 


One extreme possibility we know how to treat—namely, risk. In that 
case a probability distribution over the set of states is known—or, better 
yet, the decision maker deems it suitable to act as if it were known. For 
example, suppose in the omelet problem described above, the husband—a 
scientifically minded farmer—‘‘knows” that in a random sample of six 
eggs the conditional probability of the sixth egg’s being rotten when the 
other five are good is 0.008. Thus, he may view breaking the sixth egg 
into the bowl as the lottery: 0.992 probability of the six-egg-omelet prize 
and 0,008 probability of the no-omelet-and-five-good-eggs-destroyed prize. 
In other words, an a priori probability distribution over the states “‘good” 
and “rotten” allows one to structure the problem as one of decision making 
under risk—as a choice among lotteries. 

In general, if an a priorz probability distribution over the states of nature 
exists, or is assumed as meaningful by the decision maker, then the problem 
fan be transformed into the domain of decision making under risk. In 
Particular, if the probabilities of states 51, 52, °° ° > Sy ALE Phe, ~* oP as 


respectively, (where u 1 oan DP 0), then the utility index for act A; 
isi j=1 

ha CxPected utility, ie., usps + wiapa F * °F tiaPa: Ee act having 

om utility index is chosen, and we say that this act is “‘best 

| oe , the given a priori probability distribution.” (Equivalently, we 

think of the decision problem as a game: the decision maker is player 
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- 5 1 } obaDdD 1Dution ) 
oe i Ma j.e., against the given a priori pro! | ibution, ) 
mixed strategy; i “sumption leads us to a proble: ve already 
xtrem oR dy 
‘am aa tail. Let us, therefore, turn to the othe: me in which 
i © ; ae s. 
. as t the decision maker is ‘completely ign: as to which 
a a 
cst re prevails. ‘This phrase “completely ignorant” is vague, we 
state of natu : ; F att a 
. controversy Che vaguenese 
know, and it has led to much philosophical gueness 


will be considerably diminished when later we attempt a cope axio- 
matically with decision making under uncertainty; howev an perhaps it 
can now be reduced some by an illustration. Let us again examine the 
omelet problem, but with the cast changed. Instead of a scientific 
farmer, suppose the omelet is completed by a city boy unaccustomed to the 
ways of eggs. Furthermore, assume that the five eggs already broken 
were white, whereas the sixth is speckled brown and (to the city boy!) of 
unusual size. He doesn’t have the faintest idea what to expect, having 
had no previous experience in matters of this kind. Nonetheless, he must 
make a decision, which leads to the question of criteria for decision making 
when the states are completely uncertain. 


13.2 SOME DECISION CRITERIA 


We shall now list, but only partially discuss, certain criteria which have 


La offered to resolve the decision problem under uncertainty, which we 
shall abbreviate as d.p.u.u. A criterion is well-defined if and only 
Prescribes a precise algorithm which, for any d. p. u. u., unambiguously 


selects the act(s) which j ing 

; Is (are) taut i “opti ccording 

to the criterion,” (are) ologically termed ‘“‘optimal a 

F a each o the following criteria we shall 
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In this case, the security level for each act is 0, but if we permit randomiza- 


tion between A; and A» the security level can be raised to 4 by using 
(4A, V4Ao). This is the hedging principle discussed in section has ue 
is suggested that the reader review section 4.10, which dealt with the 
appropriateness and interpretation of a randomized strategy (act). 

The maximin principle can be given another interpretation which, 
although often misleading in our opinion, is sufficiently prevalent to 
warrant some comment. According to this view the decision problem is 
a two-person zero-sum game where the decision maker plays against a 
diabolical Miss Nature.! The maximin strategy is then a best retort 
against nature’s minimax strategy, i.e., against the “least favorable” a 
priori distribution nature can employ. We recall that in a two-person 
zero-sum game the maximin strategy makes good sense from various 
points of view: it maximizes 1’s security level; and it is good against player 
?’s minimax strategy, which there is reason to suspect 2 will employ since 
it optimizes his security level and, in turn, it is good against 1’s maximin 
strategy. In a game against nature, however, such a cyclical reinforcing 
effect is completely lacking. 

Nonetheless, just because a close conceptual parallelism between a 
d. p. u. u. and a zero-sum game is lacking, it does not follow that the 
maximin procedure is not a wise criterion to adopt. It has the merit that 
it is extremely conservative in a context where conservatism might make 
good sense. We will have more to say about this later. 

(It is customary in the literature to consider negative utility, disutility, 
or loss, as an index appraising consequences. With that orientation the 
decision maker, therefore, attempts to minimize the maximum loss he 
Tuns from adopting an act—i.e., he “minimaxes” instead of “‘maximining.”’ 
Consequently, the principle described above is usually called the minimax 
principle.) 


The following simple example exhibits a possible objection to the maxi- 
min principle: 


Sj 52 
A,|0 eh 
ya ful 


1 ate 
N In a recent lecture to statisticians one of the authors spoke of “diabolical Mr. 


a ate 
a ture.” The audience reaction was so antagonistic that we have elected the path of 
ast resistance, 
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have security levels of 0 and 1 res} | 


‘ ‘nin criterion. This re 
oil a Pd. Some consider th ble. Set. 
domized acts are consi e a ee cist tha | Bee ; 
emphasize their objection Be 0.00001 would si 
select Ag even if the 1 were reduced to V-¥ (ee increased 
10°, These critics agree that act Ae Is reasona eer 
scious adversary of 1, for then 2 should choose 1, ani ~ agains 
but, they emphasize, nature does not behave in t! nd if iden 
completely ignorant about the true state of nature, | y claim Ay i 
manifestly better. 

The minimax risk criterion (suggested by 
improvement over the maximin (utility) criterion). ‘This criterion can 
be suggested by continuing the analysis of the above d. p. u. u. If 5; is 
the true state, then we have no “risk” or “regret’’ if we choose Ag, but 
some “risk” if we choose Aj; if so is the true state, then we have no risk if 
we choose A; and a good deal of risk if we choose A». Schematically: 


Since A1 and A2 


Ga. 40047 
Wa) [1951] as an 


Utility Payoffs ‘‘Risk’’ Payoffs 
Sy S9Q S1 S52 
a8 100] > 4i[1 0 
A2|1 1 aa O99 |. 
Cenas: 
a an of risk” payoffs, A; has a possible maximum risk of 1, whereas 
- as a possible maximum risk of 99. Consequently, A; minimizes the 
axi : are 
mum risk. However, if randomization is permitted, neither A, nol 
Ag is optimal. 
The general procedure goes as follows: 
i. Toad Pp. u. u. wi +): i 
» Pp. U. U. with utilit i - : ith risk 
payoffs r;;, where r,; y entries u;;, associate a new table with ™ 


j 18 defined as the amount that has to be added to “i 


to equal the maximy a 
re m util . ‘ 
il. Choose that act ia, payoff in the jth column. 


act. minimizes the maximum risk index for each 
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particular, 


then, the arrays 


should be strategically equivalent for any 4 
_{ and b equal to —100, we get 


— 1 0| 
0 —99 |, 


which is the negative of the risk payoff array. Therefore, the maximin 
criterion for this payoff array is the same as the minimax criterion for the 
risk array. 

In criticism of this proposal, we quote from Chernoff [1954]: 


Unfortunately, the minimax regret [risk] criterion has several drawbacks. 
First, it has never been clearly demonstrated that differences in utility do in fact 
measure what one may call regret [risk]. In other words, it is not clear that the 
“regret” of going from a state of utility 5 to a state of utility 3 is equivalent in some 
sense to that of going from a state of utility 11 to one of utility 9. Secondly, one 
may construct examples where an arbitrarily small advantage in one state of 
nature outweighs a considerable advantage in another state. Such examples tend 
to produce the same feelings of uneasiness which led many to object to the [maxi- 
min utility] criterion. 

A third objection which the author considers very serious is the following. In 
some examples the minimax regret criterion may select a strategy [act] Az among 
the available strategies? A1, Ao, As, and A4. On the other hand, if for some reason 
Ay is made unavailable, the minimax regret criterion will select Az among Aj, Ao, 
and As. The author feels that for a reasonable criterion the presence of an unde- 


sirable strategy A, should not have an influence on the choice among the remaining 
strategies, 


Chernoff’s third objection to the minimax risk principle is a variation on 
our old theme of the “independence of irrelevant alternatives.” There is 
an obvious modification of the minimax risk principle which copes with the 
Problem of non-independence of irrelevant alternatives—but, unfor- 
tunately, it has its own, more serious fault. Roughly, the idea is: instead 
of Comparing an act with all others to ascertain the risk, which introduces 
the difficulties when new acts are added, simply make paired comparisons 

tween acts. Relative to the universe of any two acts, and for each state, 
nal the risk of taking each act. Of the two acts, choose the one 
i ae risk isleast. An optimal act is then defined as one which 

referred or indifferent, when compared in this way, to every other act. 

'S procedure is unsatisfactory because there are d. p. u. u.’s in which 


2 
Chernoff uses letters dj, do, d3, and d4. 
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intransitivities occur, and so for these a " ‘5 s to i 
ous optimal act. An example is the ¢. p. U- 0. 
Si 52 53 
a 4 
Hd oe 10, 4 (payoff in utility 
Ae 5 2 10 
The procedure outlined yields the following: 
(i) Ay over Ae for: A1 has a maximum risk of 5 (from s2) whereas 4, 


has a maximum risk of 10 (from s1). 

(ii) A» over Ag for: Az has a maximum risk of 6 (from s3) whereas 4, 
has a maximum risk of 8 (from s2). 

(iii) A3 over A; for: A3 has a maximum risk of 5 (from s1) whereas 4, 
has a maximum risk of 9 (from s3). 


Consequently, none of the three acts can be optimal since each is less pre- 
ferred (in a paired comparison) than one of the others. 

This same example also illustrates Chernoff’s third objection to the 
minimax risk criterion. Restricting ourselves to acts Ay and A, that 


criterion selects A» as optimal and A3 as non-optimal. When A, is added, 
the risk matrix is 


Oe so) SS 
An 0) “5° 9 
Ao|10 0 6 (payoff in risk units) 
Aaa 5 8 0 


and As is then optimal since its max 
maximum risks, 
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way example, in the class: 
us; for examp e, in the Class: 


see what seems reasonable in certain simple cla 


S51 S52 
Ai|0 1 (utility payoff). 
Ao x x 
The a-indices of A; and A» are 1 — a and x respectively. Consequently, 


a 


if one can choose an x such that A; and A: are indifferent, then one can 
impute an a-level to oneself. For example, if A; and Ag are indifferent 


419 


we 


for x = 34, then a must be 9g. Thus, by resolving a simple decision 
problem an a-level can be chosen empirically, which, in turn, can be 
employed in more complicated decisions. 

But there are also objections to this criterion; one may be illustrated 


by the following example: 


Sts? 53 
gee Oe AL oO) 
Ben 0s 0 (utility payoff). 
YOAi, Ar) | 1 0 


Suppose the a-level of 14 is chosen. The a-indices of A; and A: are each 
Y-0+ (1 — 1%) -1 = 34, whereas the index of (14441, 4A) is 14-0 + 
(1 — 4) - 144 = 3g. Consequently, although A; and A: are each opti- 
mal, the procedure of tossing a fair coin and taking A if heads and Ag if 
tails is not optimal. Critics of the Hurwicz criterion claim that any 
randomization over optimal acts (according to a particular criterion) 
should itself also be optimal according to that criterion. Remember that 
a randomization which uses only optimal acts will ultimately cause the 
decision maker to adopt one of these optimal acts! 

A second possible criticism of the Hurwicz criterion is that it resolves the 
following d. p. u. u. counter to one’s best intuitive judgment: 


ie son ea SLOO 

ah ‘laa client gi ti *| 

eye el ght lam al 0 a ec | 
ne to any a-level Hurwicz criterion, both acts A, and Ag have an 
is | wae and so they are considered indifferent; however, if one 
ia E etely ignorant” concerning which is the true state, then, the 
is A, fe, A, is manifestly better than Az. But, in defense of Hurwicz, 
ines ‘= y better than A»? What seems to be implied here is that the 
ate is “more likely” to be one of the states 52 to 5100 than 5s}. This, 
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to each act A; assign its expected utility index, 
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and choose the act with the largest index. : ones 
At this juncture, it would be apropos to digress into the philosop ak 
foundations of probability and to review the special role of the it A ‘ 
insufficient reason in relation to these foundations. But we shall ee 
this temptation, for to do the topic justice would require a sizable ss 
sion, and there are already excellent expository accounts of this ae ek 
(See, for instance, Arrow [1951 b], Nagel [1939], and Savage [1954); “- 
of these references, in turn, gives a relatively complete bibliograpny:) 
We will confine ourselves to a few simple.remarks. cael 
The principle of insufficient reason, first formulated by Jacob Ber a ‘ 
(1654-1705), states in boldest terms that, if there is no evidence ‘Sead 
one to believe that one event from an exhaustive set of mutually i, 
events is more likely to occur than another, then the events oo “ 
judged equally probable. This principle is extremely vague, 6 
indiscriminate use has led to many nonsensical results. Writers 5! 


. . and 
Bernoulli’s time have attempted to add qualifications to the pe 
to specify limited interpretations so as to avoid some of the more bial 
contradictions, 


From an empiric 


$ z ° le is 
al point of view, one difficulty with the prin'P 
1s: Suppose we ar 


4 aking 
€ confronted with a real problem in decision Se od 
oa €n our first task is to give a mutually exclusive is suc 
laustive listing of the Possible states ofnature. The rub is that — cant 
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a €m will, when r Principle of insufficient reas? 
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re organism remains fixed; 
have: 21 the orge ay 

ually good listing we might have 
anism moves to the left; ss, the 

sther complicate our description of 

_ «ng which leg first moves, whether the an ead ¢ , 


There is a counterargument to Be 


hat there are various acceptable interp 


que th 
seis 3 state in a given real problem, it is not true that we will feel that 
the states are “equally likely” in each interpretation. In other words, 
care must be exerted in the choice of states if one wishes to use this princi- 
sie. As it stands, this defense is weak in that there is a crying need for an 
empirical clarification of the term “‘equally likely.” Eventually, we shall 
examine two suggested clarifications. The first, an axiomatic treatment 
due to Chernoff [1954], characterizes his notion of “complete ignorance”’ 
‘a such a manner as to justify logically the principle of insufficient reason. 
This will be described in section 13.4. In the second, the equally likely 
assignment gains empirical meaning through the ‘‘practical”’ suggestions 
for probability assignments offered by the personalistic school of proba- 
bility (see section 13.5). 

Incidentally, the arguments against the principle of insufficient reason 
become even more cogent when there are an infinite set of pertinent states 
of nature, for then it is difficult to single out a natural parametrization, or 
enumeration, of the states tor which a suitable generalization of the 
“equally likely” criterion is appropriate. 

Before we turn to the axiomatic studies of decision criteria, what of the 
poor decision maker who is now totally confused by the pros and cons of 
the above criteria? Can he, in desperation, compromise by adopting 
some sort of arbitrary composite of the criteria? Subsequently, we will 
Suggest some plausible composites; however, for the present, the following 
example must be included as a note of caution, for some apparently accepta- 

le compromises may not be so acceptable after all. 

Take the case of a decision maker who cannot crystallize his preferences 
=a the maximin criterion, the Hurwicz criterion with a = 34, and 

Principle of insufficient reason. He thus decides to define one act as 
Preferable to another if arid only if a majority of these three criteria 
Fegister this preference. The following d, p. U. Us establishes that this 
“ompromise procedure is not well defined: 
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riteria select A, over A2; 

an intransitivity- The majority decision pru 
fare contexts (Chapter 14) leads to the same €1 
of preference. The reasons are analogous 


A majority of the c 


13.3 AXIOMATIC TREATMENT: THE AXIOMS NO RRING 1 
“GOMPLETE IGNORANCE” 


Instead of applying specific proposed decision criteria to carefull 


selected decision problems, thereby determining whether or not each 
criterion complies with our ‘ntuitive criteria (which we deem to be reason- 
able), let us, as so often before, invert the procedure. Let us cull fron 


our intuitions certain reasonable desiderata for decision criteria to fultil, 
which we can then investigate both as to compatibility with one another 
and as to their logical implications. Our axiomatic presentation maitl) 
ale a- but it is also a curious mixture of the works a! 
5s eee [1951 a], Savage [1954], Arrow [1953], and 
published comments by Rubin. 
There are two distinct types of axiomatic approaches in the literatur®: 
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1 A’ > A’: means that A’ weakly dominates A" in the sense that {’ is 
ill. ~ 4 M 
eferred to A” for at least one state and is pre 
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for all other states. 


Since any d, p. u. u. is characterized by a clas 
of nature Ss and a utility function u, we may sym volically identifs 
d, p. u. U- with the triple (@, S,u). A decision criterion associates to each 
7 e., to each (@, S, u), a subset @ of @; the acts in @ are called 
d. p. U.. u., 3 b] ’ 
optimal for (@, S, u) relative to the given criterion. @ is called the choice 
or optimal set. 


Desiderata for criteria 


Axiom 1. For any d. p. u. u. (Q, S, u), the set @ is non-empty, 1.€., every 
problem can be resolved. 

Axiom 2. The choice set for d. p. u. u. does not depend upon the choice of 
origin and unit of the utility scale used to abstract the problem. 

Axiom 3. The choice set is invariant under the labeling of acts, i.e., the real 
acts singled out as optimal should not depend upon the arbitrary labeling .of acts 
used to abstract the problem. 

Axiom 4. Jf A’ belongs to @and A” = A’ or A” ~ A’, then Al" belongs to G. 


Axioms 1 through 4 are quite innocuous in the sense that, if a person 
takes serious issue with them, then we would contend that he is not really 
attuned to the problem we have in mind. 

Anact A’ is said to be admissible if there is no act A in @ such that A zm A’. 
ie., A’ is admissible if A’ is not weakly dominated by any other act. 


Axiom 5. If A’ belongs to G, then A’ is admissible. 


Axiom 5 is equivalent to: 


Given A’, if there exists an A such that A 7 A'( that is, if A’ is not admissible), 
then A’ does not belong to @. 


It should be noted that as they were originally stated neither the maxi- 
Min principle nor the Hurwicz a-criteria satisfy axiom 5; however, both 
_ be appropriately modified in a trivial manner. To see the problem, 

nsider the following d. p. u. u.: 
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iom 6. Adding new acts | 
se t, has no effect on the of timali 


by or is equivalent to some old ac 
of an old act. 


Example. A gentleman wandering in a strang liner dine 
chances upon a modest restaurant which he enters ainly. The 
waiter informs him that there is no menu, but that this evening he may 
have either broiled salmon at $2.50 or steak at $4.00. In a first-rate 


restaurant his choice would have been steak, but considering his unknown 
surroundings and the different prices he elects the salmon. Soon after 
the waiter returns from the kitchen, apologizes profusely, blaming the 
uncommunicative chef for omitting to tell him that fried snails and frog’s 
legs are also on the bill of fare at $4.50 each. It so happens that our hero 
detests them both and would always select salmon in preference to either, 
yet his response is “Splendid, I’ll change my order to steak.” Clearly, 
this violates the seemingly plausible axiom 6. Yet can we really argue 
that he is acting unreasonably? He, like most of us, has concluded from 
Slee cx perience that only “good” restaurants are likely to serve snails 
Be dine new acts oo assumption implicit in axiom 6, namely, 
eo oad. p. u. u. does not alter one’s a priori information ® 

fo which 1s the true state of nature. In what follows . se that 
this proviso is satisfied. In ae * at follows, we shall suppose i 
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pocToR: That’s true, isn’tit? In tha 
nurRSE: Please repeat that! 


The example given at the end of th 
criterion shows that axiom 7 rules out the 

Note that axiom 7 does not prevent an opt 
into a non-optimal one by adding new acts; thi: 


new acts is optimal. Therefore, one might wish to strengther ) 


Axiom 7’. The addition of new acts does not transform an old, originally non- 
optimal act into an optimal one, and it can change an old, originally optimal act into 
anon-optimal one only if at least one of the new acts is optimal. 


A further strengthening of axiom 7 is: 


Axiom 7”. The addition of new acts toad. p. u. u. never changes old, originally 
non-optimal acts into optimal ones and, in addition, either 


(i) All the old, originally optimal acts remain optimal, 


or 
(ii) None of the old, originally optimal acts remain optimal. 


The all-or-none feature of axiom 7’”” may seem a bit too stringent, but 
one can offer this rationalization for it. Suppose that the merit of each 
act can be summarized by a single numerical index which is independent 
of the other acts available. Then the optimal set of the original problem 
is composed of all the acts with the highest index. Now, among the new 
acts either there is one with a higher index, which therefore annihilates 
all the old optimal acts, or there is not and the original optimal set is left 
Intact. A severe criticism of axiom 7” is that it yields unreasonable 
results when it is coupled with either of the more palatable axioms 5 and 6. 
Take, for example, the following d. p. u. u.: 


Si So S53" Sa 
As|O« 4.\2. 2 
Ag|4 0 °0 44. 
z is reasonable that some criterion should allow both A; and 42 in the 
Pumal set. Now add an A3 whose utilities are 
A3s[4 0 0.1 4] 


ee is weakly dominated by 43, axiom 5 implies that act Az cannot 
ns oe Optimal. But one may very well want also to keep Ai as optimal, 
‘ ation of 7’. The rationalization of axiom 7’’ (namely, that each 

fan be fully appraised by a single index) is apparently not suitable. 
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Axiom 7’, An act A! is optimal only if it is optimal in the paired comparison 


hetween A’ and A, for all A in Q. 7” | 
This axiom enables us to transform the decision problem into a series of 


paired 


comparisons between acts and to eliminate those acts which are not optimal in 


any one of these comparisons. We will not, however, use this condition. 


Axiom 7 and its different versions are somewhat controversial. 
of these rules out the minimax risk or regret principle. We are 
sympathetic to axioms 7 and 7’! The others, 7’ and 7”, are sli 
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harder to see through (i.e., they are a little less intuitive), so let us suspend 


judgment until some of their consequences are stated. 


The next axiom is due to Rubin. To suggest it, suppose 4 decision 


maker is given two decision problems having the same sets of aval 
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acts and states but differing in payoffs. Suppose the second problem is 
trivial in the sense that the payoff depends only upon the state and no! 
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pon the act adopted. In other words, in the array representing problem 
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basic desideratum. 
4, Consider the following problems: 


Problem 1 Problem 2 Problem 3 
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where, it will be noted, problem 3 is a mixture of the other two in which 
each is played with probability 19. Intuitively, a plausible method for 
analyzing these d. p. u. u.’s is to be somewhat pessimistic and to behave 
as if the less desirable state is somewhat more likely to arise. ‘The extreme 
example of this rule is the maximiner who focuses entirely on the unde- 
sirable state, but our point holds equally well for one who emphasizes the 
undesirable state only slightly. In problem 1, sj is less desirable, and so 
one is led to choose A;. In problem 3, se is less desirable, and so one 
might be led to choose Ay. But if one subscribes to axiom 8, the same 
alternative must be chosen in both cases, and so we are led to doubt the 
axiom. 

iii. Axiom 8, when added to axiom 3 (i.e., the choice set is invariant 
under labeling of acts) and to axiom 7 (i.e., the addition of acts cannot 
make anon-optimal act optimal), both of which are extremely reasonable, 
Yields the following result: If an optimal act of a given d. p. u. u. is equivalent 


x probability mixture of two other acts, then each of these acts is also optimal. 
or example, in the d. p. u. u. 


$1 SQ 
Ai 0... 2 
Apit . 0 
Asl 1]; 


if Ay 18 optimal, so are A; and Ag, since A3 is equivalent to (4441, 4A2). 
a implies the result that one need never resort to randomized acts in 
Bebena of decision problem. Since, it is contended, this consequence 
ik , one should discard the weakest link in the argument leading to 
herefore, axiom 8 should go. 


) ; f : 
W; to argue against these arguments point by point: 
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pein’ axiom is not only intuitively meaningful but it seems per- 
reasonable. This is a matter of taste! 
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se either A; or Ag; however, 
that they could easily be persuaded to a e E es et bs 
once they become aware of the axiom they will find it accept 
use it to decide upon A; in that problem. geen 
iii. Is the assertion that one need never resort to peonuze ‘ ré i 
inad.p.u.u.so absurd? Maybe not, for one can cite many rage 
criteria which lead to an optimal non-randomized act for any d. P. ‘i 
Furthermore, there are arguments against randomization, for oa 
part of the discussion found in section 4.10, where we examined the HS 
tional interpretation of randomized strategies and cast some doubt » ; 
their applicability, can be taken over almost verbatim. Finally, pe 
(1954, p. 438] argues as follows “It would seem that the need for ey 
domization depends on the statistician’s need to oversimplity the Hi 
ment of his problem because with limited computational ability he 7 
not take full advantage of the actual relationships involved. Genera" ; 
the simplification has the effect of combining states of nature which are equiva 
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lent when random samples are insisted upon.’ [Italics ours.] This discus 
leads naturally to the next axiom. 
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‘ modification of the maximin criteri 
sa were unwilling to endorse its pt 
would continue, axiom 9 is not as innocu 
consequence of some other more basic a 
much, but it does not seem to him to warran 
Suppose A1 and A» are both optimal acts. 
as (1441; 16 Ao) will, operationally, result in a selection é 
optimal acts. Nonetheless, the mixture may evoke a_psycholo; cal 
response in its own right, and, before it is known which optimal act is 
adopted, there is no compelling reason why the anticipation of the mixture 
must be as good as either A; or Ag. For example, an optimist might like 
both A; and A» because in each case he can look forward to very desirable 
returns if certain states obtain; however, with the randomization all 
expected returns will be mitigated, and so the anticipation is not nearly so 
pleasant. Of course, the counterargument is that the apparent reason- 
ableness of the axiom simply demonstrates the irrationality of the opti- 
mist’s wishful thinking. So the battle is joined. The present authors 
are very partial to the axiom and believe the argument against it is rather 
weak, 

So far we have not tried to characterize the notion of “complete 
ignorance.” Our purpose in postponing this discussion is obvious: 
Axioms 1 through 9 are pertinent to decision making where one is not 
“completely ignorant” of the true state. It is interesting that, even with- 
out committing ourselves on the notion of “complete ignorance,” accept- 
ance of axioms 1 through 9 serves to eliminate the maximin criterion 
(eliminated by axiom 8), the minimax risk or regret (eliminated by axiom 
7 or any of its variations), and the Hurwicz a-criteria (eliminated by 
axiom 8 and, for « < 1, by axiom 9). Nonetheless, axioms 1 through 9 
are compatible: the criterion based on the principle of insufficient reason, 
or example, satisfies all of them. 

The following theorem is basic: 


- each criterion which resolves all d. p. u. u.’s in such a manner as to satisty 
=. 1, 3, 4, 5, 7’, 8, and 9, there is an appropriate a priort distribution over the 
§ of nature which is independent of any new acts which might be added, such 


bri an act is optimal (according to the criterion) only if it is best against this a 
on distribution. 


digg his theorem does not say that if an act is best against this a priori 
ee then it is optimal according to the criterion. It only says the 
Be é The theorem indicates that, if we are committed to axioms 
>», 7’, 8, and 9, our first step should be to search for a suitable 
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13.4 AXIOMATIC TREATMENT: | a 
“CQMPLETE IGNORANCE 
Now we turn to the question of “complete ignorance.”’ Consider th, 
following: 
Axiom 10. For any d. p. U. Us the optimal set should not depend upon th 


labeling of the states of nature. 


Obviously, if we have reason to suspect that a given state of nature js 
quite likely the true state whereas another state is quite likely not the true 
state, then in any abstraction of the problem we wish to distinguish between 
these two states. Or, if we number the states of nature in a given problem 
in such a manner that the lower the number the more likely we feel that it 
is the “true” state, then certainly we want to keep the labeling of the 
states in mind and axiom 10 would not be at all appropriate. Loosely 
speaking, whenever axiom 10 is not appropriate, we are not in the realm 
of “complete ignorance.” 

There is a tendency to read too much into this axiom. Some hold that 
adopting axiom 10 is essentially equivalent to assuming that each state i 
equally likely. Although this is true when a suitable collection of the 
i eee is added to 10 (see below), it is not true for 10 alone, 0 for 
secant beter nm. For xp von” 
axiom 10 has the ‘tle f . . goal * fe paired comparison) the 
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wi and only if,” 1.¢., if an act has the highest average utility, then it is indeed 
jimall To round out the picture, the same result hoids if for axiom 7 
‘tutes axioms 6 and 7. (Note that 7’ implies 


an be shown that the “‘only if” assertion can be strengthened 1 


when it is bolstered by 4 and 5 it also implies 6. ) 
acterize the criterion based on the principle of insufficient reason, 
it is the unique criterion which satisfies them. This result is due to 
Chernoff [1954]. 

The maximiners and minimaxers, however, argue that, although axiom 
10 is all right, it does not go far enough in characterizing the notion of 
“complete ignorance.” For example, consider the two d. p. u. u.’s 


noe, U,1 DP Wil) <2 
Sie SOO seeo 4 Si S2 
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aero) .5 5. 5 Aig. 1:0}: Su) 
According to the criterion based on the principle of insufficient reason» 
Ao is optimal for d. p. u. u. 1 and A; for d. p. u. u. 2. But if one is truly 
completely ignorant about the true state in each problem aren’t these prob- 
lems identical? Ind. p. u. u. 1, 52, 53, and s4 can be strategically lumped 
into one state—call it s*. True, s* is “not less likely” to be true than 
either 52, 53, or s4, but if we are completely ignorant we cannot say any- 
thing about s; versus s*. The principle of insufficient reason interprets 
complete ignorance as “‘each state being equally likely,” so s* must be 
treated as if it were “three times as likely” as s1, and, therefore, this 
criterion chooses Ap. But, in considering s* as more likely than s,, one 
admits that he is not completely ignorant. According to some, the very 
€ssence of complete ignorance is to treat d. p. u. u.’s 1 and 2 as equivalent. 
They would add that one is almost never in a state of complete igno- 


Tance, but they would insist that, if one wants to list reasonable desiderata 


ies criteria which purport to handle this case, the following axiom is 
Indispensable. 


on ll. If ad. p. u. u. is modified by deleting a repetitious column (i.é., 
4 lapsing two states which yield identical payoffs for all acts into one), then the 
Plimal set is not altered. 


Axiom 11 can be strengthened to: 


; Axiom WW’. Ifad. p. u. u. is modified by deleting a column which ts equivalent 
pr obability mixture of other columns, then the optimal set 1s not altered. 


> 
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feels strongly about the criterion asec ~ ‘ il 

If one fee d also wants to endorse axiom 11, t! tte 
— esi ri ‘ Tn any d. p. u. U- delete all rep ‘le — 
into this criter1on- u. choose those acts havi es ie 


this modified d. p. u. 
payoff (equal weights). 


This criterion fails to sai 7 oka 
For example, ! 


i iations consider the followin 
its var ; 


ie 52 | «S3 
merit 2600CCOO 
moee 10 10 
aD 


If the choice is confined to A; or A, Ai is optimal (since by axiom 11 5; 
is deleted). If Az is added to A; and Az, then s3 cannot be deleted, and 
according to this criterion A» is changed from non-optimal to optimal 
whereas A; is changed from optimal to non-optimal. Thus, any variant 
of axiom 7 is contradicted. 

Axioms 10 and 11 together are said to characterize “complete igno- 
rance.” Although axioms 10 and 11 are compatible, and axioms 1 to’ 
are compatible, all eleven obviously are not. Something will have to be 
deleted, and one possible candidate is Rubin’s axiom 8—which amoun's 
a ea ee of a constant to a column has no effect on tie 
city ; ps ee ts, modified to the extent of et 
through 6, plus an a Sore applying the criteria, peg en ) 
a Rien of 7, plus 10 and 11. The maximin (utili 

sree: [19 53], “wey same way, satisfies these and axiom 9 in een 
the following coat if ae @ ie due to Hurwicz [1951 a], has Otis 
it takes into account ea. Busts axioms 1, 3, 4, 7”, 10; . , 
with each act. 1, only the minimum and maximum utility eet 
are to be used to te a Particular way these maxima and mini 
of axioms, For he act as best is left unresolved by the a 
this axiom set. Another a the Hurwicz a-criteria are compatible ie 
only if either its minim ce Compatible criterion is: An act is optimal if - 
When there are ties for are jot than the minimum of any other “ae 
among those acts aM oe largest minimum, it has the largest maxim" 

Suppose that we let € largest minimum. ith 2 
ae . Baer the minimum utility associated ie 0, 

UX of the ao if we accept this axiom set (1, 3, % is 
and (magn 18 to decide upon an ordering betwee? the 
f we also demand that axiom 2 be mee a 
ormation, Thu ordering when we change the utilin® (a 

8, if the criterion selects (m’, M’) <a 


ie 


The Axioms Referriz 
13.4] 


yf’) then it must also select (am’ + 
: here a > 9: In this connection, th: 
W: 


d. p. u. U. 


by 


there exists a number a such that we would say A, is optimal for all 
x <1-— a, and A¢ is optimal for all x > 1 — a, and if we demand that a 
criterion yielding this decision also satisfy axioms 1, 2, 3, 4, 7”, 10, and 11, 


then it must be Hurwicz’s with index a. 

The approach just used, which will be employed again in the next sec- 
tion, warrants a comment. We first commit ourselves to a class of 
axioms, thereby restricting the class of potential criteria. Second, we 
consider a simple class of d. p. u. u.’s for which we feel able to make sub- 
jective commitments as to the optimal sets. If our choice of axioms and 
special cases is clever, then by using the axioms we can logically extend 
the consistent decisions given for a simple class of d. p. u. u.’s to a precise 
formula which resolves all d. p. u. u.’s. 


> Milnor [1954] states a set of requirements for reasonable decision criteria, 
where the criteria do not select an optimal set of acts but yield a complete (transi- 
tive) ordering for all acts. The analysis is much simpler in these terms. We out- 
line his work here with a minimum of comments. In parenthesis after each axiom 
We give the nearest corresponding statement in terms of optimal sets. 


1. Ordering. All acts must be completely ordered. (1.) 
‘ : if The ordering is independent of labeling of rows and columns. (3 
n . 


ce <n domination. Act A’ is preferred to A” if A’ strongly dominates A”. 
and 5, 

4. Continuity. If A’ is preferred to A’ in a sequence of d. p. u. u.’s, then A” is not 
Preferred to A’ in the limit d. p. u. u. [A sequence of d. p. u. u.’s converge to a 
ace d. p. u. u. if the utility numbers for each (act, state) pair converge to the 

ity number of the (act, state) pair of the limit d. p. u. u.] (No correlate.) 
: Linearity, The ordering is not changed by linear utility transformations. (2.) 
- Row adjunction. The ordering between old rows is not changed by adding a new 
Tow, (7, Pe Tae mtr.) 
(3 : Column linearity. The ordering is not changed by adding a constant to a column. 


= aon duplication. Adding an identical column does not change the order- 


is br Convexity, If A’ and A" are indifferent in the ordering, then neither A’ nor A’” 
16 eed te O44, Moa". (9.) 
Orde; peal row adjunction. Adding a weakly dominated act does not change the 
ng of old acts. (6.) 
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summarizes hi 


Milnor Laplace Wald du 
Axiom ) 
® @ g 
1. Ordering @ ® 9 
2, Symmetry 
3, Str. dom. 1 ) : 
4. Continuity : ‘ 2 
5. Linearity @ 
6. Row adj. @ a 
7. Col. lin. @ S 
8. Col. dup. @ © @ 
9. Convexity - @ + @ 
10. Sp. row adj. x .. ‘a @ 


In this tabulation Laplace refers to the criterion based on the principle of insutl 
cient reason, Wald to the maximin utility criterion, Hurwicz to the a kad 
pessimism criteria, and Savage to the minimax risk or regret criterion. ‘a 
means the criterion and the axiom are compatible. Each criterion 1s charac 
terized by the axioms marked @). 


Note that, unlike Chernoff’s characterization of the Laplace criterion, Milnor’ 
does not require the convexity axiom. This discrepancy seems strange until : ; 
recalled that Milnor’s axioms 1 and 6 are stronger then their correlates 9 C e 
noff’s system. Milnor demands a complete ordering, not just an optimal set, and 


his sixth axiom corresponds to axiom 7” (cf. p. 289) which is stronger than axiom! 
used by Chernoff, 


_ Another point of di 
tinuity. All four of t 
weak domination (ie., 
tion. 


, : con: 
Screpancy is Milnor’s use of strong domination one 
he criteria satisfy these conditions, but they WC 


i domin@ 
Se ORNS, p. 287 loyed instead of strong 
To see this, consider sie 4. p. s “a — 
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43.5 THE CASE OF “‘PARTIAL IGNORAN 


4 common criticism of such criteria as the maxin 
Hurwicz a, and that based on the principle of insufficient reason 


regret, : ; : 

i; that they are rationalized on some notion of complete ignorance. 4n 
practice, however, the decision maker usually has some vague partial 
information concerning the true state. No matter how vagu it is, he may 
not wish to endorse any characterization of complete ignorance (€.g., 


axiom 10 or 11), and so the heart is cut out of criteria based on this notion. 
The present section is devoted to suggestions for coping with this hiatus 
between complete ignorance and risk. 

As background for this discussion, consider a contestant on the famous 
$64,000 quiz show who has just answered the $32,000 question correctly. 
His problem is whether to choose act Aj, to try for $64,000, or to choose 
act Ao, to stop at $32,000. His d. p. u. u. takes the form: 


The $64,000 question is one that the contestant 


5, = could answer sz = could not answer 
Obtain $64,000 (tax-| Obtain a consolation 
able) plus _ prestige, prize of a Cadillac, 
A = try for $64,000 | publicity, etc. plus knowledge that 
$32,000 (taxable) was 
lost 
Obtain $32,000 (tax-|Same as (Ag, 51) pair 
ea able), get less prestige 
_P and publicity than for 


the (Aj, 51) pair 


We assume that in utility terms the problem reduces to the form: 


Sj S2 


Ayito 
Ay|x X*4- 


os Us suppose, further that no other contestant has ever tried for the 
question. For all our contestant knows, the difficulty of the 
on can run the gamut from the impossible to ‘“What was the color of 
om white horse?” Everything hinges on his appraisal of the 
i ee litics of s; and sy. He might take the point of view that he 
tosia etely ignorant of the true state, but it is much more likely that he 
take into consideration such intangibles as: (a) the public reaction 
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ally processed. 


Suppose, after due deliberation, the oe Ay Wee 
pp ; that he behaved as if it were meaningi 18D aN a prin 
ee of x or greater.° Conversely, one 1 ed to say that 
ee mnie i i. probability” of 51 is x or greater, then A; should be 
al i. th oe of ideas which will be partially formulated nov, 
ee rete re he school led by Savage |1954), which holds 
We shall first report on the sc 


the view that by processing one’s partial information (as evidenced by 
one’s responses to a series of simple hypothetical questions of the Yes-No 
variety) one can generate an @ priori probability distribution over the 
states of nature which is appropriate for making decisions. This reduces 
the decision problem from one of uncertainty to one of risk. ‘Thea prion 
distribution obtained in this manner is called a subjective probability 
distribution. | 

Savage, in his The Foundations of Statistics, ‘develops, explains, and 
defends a certain abstract theory of behavior of a highly idealized person 
faced with uncertainty.” The theory is based on a synthesis of the works 
of Bruno de Finetti on a personalistic view of probability and of the modern 
theory of utility due to von Neumann and Morgenstern. Since Savas? 
expounds his position with vigor and clarity, we shall merely attempt to 
Capture what, to our minds, is the most salient contribution of his school 
Furthermore, we shall not follow Savage’s development of the subject 
a ae ae new concepts onto the development given the 


L ae & ; 
es é 51, 52, feist bea labeling of the possible states of nature for S 
eee decision Problem. Each of these labels refers to specific a 
Phenomena and we (in the role of a decision maker) might feel - 
re plausible than others. Suppose, furthermor® fe 
j are convi ; pen {a 
‘Ng Problems of thi Avinced that we want to be consistent 


ome 


‘ deo” 
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h satisfies these axioms must 
are best against some specific a 


since ad 
Thus, it is reasonable to assert that if there exist: f 

‘ priori probability distribution over the states, then this distribution 
depends solely upon our state of information concerning 51, 52, * °° , Sn 
The strategy now is to consider a series of simple hypothetical d. p. u. u.’s 
with these states of nature, to resolve them according to our best intuitive 
judgement, and then to use these commitments to infer a plausible a priort 
distribution. 

Let us illustrate the procedure by a case which involves three specific 
states 51, 52, and s3. In order to generate an “appropriate” a priori dis- 
tribution over these states let us introduce two hypothetical acts, Ai and 
A, such that their consequences for the various states have the following 
monetary equivalences: 


S1 S2 S3 
Ai i $0 fog) 
Ao| $y $y $y |. 


Adjust act A, i.e., y, until we are indifferent between Ai and Az. Sup- 
pose the point of indifference (which is assumed to exist) is at $65. Sup- 
pose, further, that we are indifferent between obtaining $65 for certain 
and getting $100 with an objective probability of 0.8 and $0 with an 
objective probability of 0.2. Hence the utilities of $0, $65, and $100 
can be taken as 0, 0.8, and 1. In utility payoffs we have 


Sy 52 53 

Yaghe. Or aOiw Lot 

Ag ee OnB: “OF8"]: 
Now, indifference between A, and A» is compatible with an a priori dis- 
tribution only if the a priori probability of 53 is 0.8. If we have no prefer- 
€nees about the states themselves then, as a check and possible short cut, 
we could ask ourselves: “If we were given the alternative (a) of obtaining 
4 prize of x dollars if s3 turns out to be true and nothing if s1 or s2 were 
a Versus the alternative (b) of obtaining a prize of x dollars with objec- 
Probability and nothing with objective probability 1 — p, for what 
aa we be indifferent??? To check, we would require that indifference 
Me atp = 0.8 independent of the value of x, so long as it is positive! In 


} 2 8im} eye - 
. ag manner, we could force ourselves to accept a probability assign- 
Nt for 5 and for s1. In practice, however, one’s choices for a series of 
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ume ; econo 
ae tent. Let us assume that this jockeying he 
consistent. modifying them, — 


r consistency, 
ds ultimately to a bona fide a p7 tion. No 
tisfy the a dab 


checking on thei 


j —lea 
sistency, etc. J nate 
if we wish our decision criterion both to sa 


d results that agree with our by now consisten sreferences fh 


to yiel oe 
simple hypothetical problems, then we are committed \ criterion whici 
selects as optimal only acts which are best against th priori distribution 


P To describe precisely what Savage means by a consistent set of preferences 
we must outline briefly his postulates for a personalistic theory of decision. The 
assumed ingredients of the decision problem are: 


i. The set of states of the world—a set S with (an infinite number of) elements 
s,s’, + + and with subsets £, E’, - - + called events. 

ii. The set of consequences—a set C with elements ¢, eee * 

iii. The set of acts—a set @ with elements A, A’, °° - 

iv. An assignment to each act-state pair (A, s) of a consequence from C which 
is denoted by A(s). 

v. A binary relation > between pairs of acts which is interpreted to mean ““s 
preferred or indifferent to.” 


Savage then postulates and defines the following: 


P . . . } 
ostulate 1. The relation = is a weak ordering of the acts, t.¢., every pair a ee 
comparable and the relation is transitive. 


Definiti : | 
are Bid. ae aes “A = A’ given E” means that, if acts A and 4 
0 that their consequences are the same for every state not included 


in the event E. r clud 
, but if they are not changed for the states in E, then the modification 


of A is preferred or indi 
. d 
Stina hetohge to the modification of A’. 


Fe titton ts ace ell defined unless the preference relation betwee? 0 
ened xo to depend upon the particular agreement selectec 
Postul Xt postulate makes this assumption indirectly. 

0 ti 
stulate 2. Conditional preference 
> 


soi 
| for 


Definition, | as defined above, is well defined. 
i on, fA = ce ! 
if and only if 4 > ae) ¢and A’(s) = ¢! for See in ,,then we define 62 

€ given A and 4’ ' ) | 
Consequences a. A’ of this definition are called “constant” acts sin’ . 


re inde e : 
on set of C  neaanes which state holds. The relation 2 '§ exten 4 
Det it for each seed identifying each consequence with the constant ’ 
Ifion. An : 
event ¢ j ef 
2 Le., for every A and ae riled null if every pair of acts are indifferen" giv’ 
> 


| deere 
IPR; © A’ given $ and A’ > A given ¢. 
E is a non-null even 


E if and only ife > ai 


Postulate Cy 


then A > 4! ,- ui 
5 aed t and A(s) = ¢ and A'(s) = ¢ J” all 5" 
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This asserts that conditional preferences d 

Definition. The event E is said to be 


whenever 
(i) ¢ and c’ are any two consequences such that 


(ii) A(s) = ¢ for s in E and A(s) = c’ for s not in E, 


and 
2 Ui te , 
(ii) Als) = ¢ for sin E’ and A"(s) = ¢’ for s not in E’, 
then.A’ = A. 
Postulate 4. Probabilitywise, any two events are comparable. 
Postulate 5. ° There is at least one pair of acts which are not indifferent. 
ostulate 6. Suppose A > A’. For each consequence c, no matter how desirable or 
. . a q as 
undesirable it may be, there exists a sufficiently fine partitioning of S into a finite number of 
gents such that if either A or A’ is modified to yield c for any single event of the partition 
the preference for A over A’ is not changed. 


Postulate 7. Let A’ be an act and let A,’ be the constant act which agrees with A’ for 
the states. Then, 


(i) A> Ay,’ given E for all s in E implies A > A’ given E, 


and 
(ii) A,’ > A given E for all s in E implies A’ >= A given E. 


From these seven postulates Savage is able to show (among other things) the 
following two theorems. 


Theorem. There exists a unique real-valued function P defined for the set of events 
(subsets of S) such that 


() P(E)>0 forall E, 
(ii) Ps) = 4, 


(ili) If Band E’ are disjoint, then P(/EWE’) = P(E) + P(E’), 
and 


(iv) Bis not more probable than E’ if and only if P(E) < P(E’). 


a is called the personalistic probability measure reflecting the individual’s reported 
Mgs as to which of a pair of events is more likely to occur. 


ha teorem. There exists a real-valued function u defined over the set of consequences 
i a the following property: If E;, where i = 1,2, °° * 5m, tS @ partition of S and A 
tition act with consequence c; on Ey, and if Ej’, where i = 7, 2, °° ©, Hess ao another par- 

of S and A’ is an act with consequence c; on E;’, then A = A’ if and only if 


™m 


y u(oe)P(E) 2 bY u(ci') P(E). 
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utilities, and it is established that it can be done in suc! iy that expectey 


utilities correctly reflect preferences. Thus, Savage's contribution, 
major one in the foundations of decision making—s a synthesis of the von 
Neumann-Morgenstern utility approach to decision making and de 


Finetti’s calculus of subjective probability. 

To transform vague information concerning the states of nature into an 
explicit a priori probability distribution, the decision maker has had to 
register consistent choices in a series of simple hypothetical problems 
involving these states. No one claims that this is an easy task, but some 
go so far as to assert that in some contexts even these preliminary choices 
are too difficult to make with any confidence. They hold, further, that, 
if consistent responses are forced, the results are not very reliable and to 
build upon them is a mistake. They feel, introspectively, that, if one 
could instantaneously wipe out the memory of one’s past choices and if 
the process for obtaining a subjective a priori distribution were immediately 


repeated, the new a priori distribution could easily be quite different from 
the old one. 


There are two su 
Hodges and Lehman 


in X (the set of all randomized acts); let § 


n S (nature’s state set); let y denote an 4 pr 

A r S; and let Y be the set of all a prior! oe 
edge can be utilis 4 S we have seen, Savage suggests that partial knowl 
ed to find a unique a priori distribution ys 2% ™ 
ee an A which is best against y”. Hurwicz 9° 


n: h 
© Suggests that partial ignorance over 
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terion associate to each act x the a-index 
b] 
am, + (1 — a)A 

where Mx and M, are, respectively, the minimum and maximum payoff: 
which result from x as the a priori distribution y runs over its domain Y 
Choose an act which yields the highest a-index. 

The spirit of Hurwicz’s proposal is quite clear, and there are contexts? 


where we feel his specific proposal can be employed. In general, how- 
ever, we feel that his suggestions are too vague to resolve the problem. 
Operationally, how does one characterize the elements of Y‘?? Even if all 
“reasonable” y are included in Y“”, can’t some y’s be “‘more reasonable” 
than others? Maybe one could capture this differential plausibility for y’s 
in Y by an a priori distribution on Y. But why stop there? There is 
a next level, and a next, etc. Of course, expedient compromises can be 
made, and Hurwicz’s original hope still has merit: that from a lot of 
special decisions about Y , one will come closer to extracting faithfully 
one’s partial information about the states than by a forced choice of an 
a priori distribution. 

Independently of Hurwicz, Good [1950] has offered much the same 
suggestion for processing information; however, he subsquently used the 
maximin criterion rather the a-criteria. 

Hodges and Lehmann [1952] also take the position that, in practice, 
information about states of nature often lies somewhere between complete 
'snorance and a precise specification of an a priori distribution. For 
‘ample, an a priori distribution y‘” might seem likely and yet not be 
sufficiently reliable to base decisions on. An act which is best against 
Y°’ might involve a large risk if some state actually turns out to be true. 

ote that Hodges and Lehmann, like most statisticians, phrase their 
Tesults in terms of risk payoffs rather than in utility payoffs.) So they 
Propose that: (a) An act (maybe randomized) be found which minimizes 

© maximum risk; let its maximum risk be C. (b) On the basis of the 
ay C and the context of the problem, choose a quantity Co, greater 
os: to serve as the maximum tolerable risk. (c) Choose an act x 

is best against y(” subject to the condition that the act has a maximum 
ae is some question here of the existence of the minimum nd seams how- 
te oe mathematical point of view, this can be taken care of easily. ; 
WO states of nature be whether a subject does or does not have tuberculosis, 
to aot that from medical statistics the proportion of people having T.B. eowu 
ee 2 Seb icct is selfeclected, we may ae 
eater th ity of T.B. is x; but we may find it acceptable to say ything 


5 an or equal to x, and, conceivably, we might behave as if we were completely 
“Rtas to which value it has in this interval. 
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ideas can be app 


flict, ie., where the adversary is not neutral but a true adversary, |, 

y beUsy = > let 41< i ee 

two-person non-zero-sum, non-cooperative game, !et refer to player 
»» and to 2 as “the adversai y.’ faa dese 


as “the decision maker 
maker wishes to choose an “optima 00 
(acts) available to him. One modus operandi for the decision maker is to 
generate an a priort probability distribution over the states (pure strategies 
of his adversary by taking into account both the strategic aspects of the 
game and what “psychological” information is known about his adversary, 
and to choose an act which is best against this a priori distribution. To 
determine such a subjective a priori distribution, the decision maker might 
imagine a series of simple hypothetical side bets whose payoffs depend 
upon the strategy his adversary employs. ‘This is easier said than done, 
however, since the decision maker cannot ignore the possibility that his 
adversary will attempt to hypothecate such a procedure for him and will 
adjust his choice of strategy accordingly. In other words, the decision 
maker’s very selection of an a priori distribution for his adversary sets UP 
indirect forces to alter this initial choice. If such is the case, one a” 
ee ae Sep ing po 
change—until oe. = pete pee back = panes puoducs eo 
De that ron ss - equilibrium in the decision maker’s ay 
played. If in és an ap aged ones seatratesy ar pei 
Seo mihat his <a. ation the theory is clear cut and if a decis | 

ersary will comply with the theory, then, ° 


sense, the theory d : 
efines the decisi : ees 
: 3 eerer distr! 
tion for his adversary. aker’s choice of an a priori 


Of course, a decision ma 
appraisals of his adversary 


way. For exam 
ple, h 
and Lehmann [1952] = ald use the compromise suggested by 


|’ set from the set of possible strategie; 
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“The formulations given here may be app icable also to games pilayea 
against 20 opponent rather than against Nature. This would be the 

Gn the two-person zero-sum game) if one believed from past 


that the opponent is likely to make ce 
take advantage of these and still protect oneself in case the opponent has 


improved.” Or, the decision maker migt t use the 1UrI WICZ DI opos al and 


maximin, or he might use an a-index over some suitably chosen restricted 
class of a priori distributions. 


> It is interesting to reconsider the appropriateness of the axioms for a reasonable 
criterion when nature is replaced by an intelligent adversary. Axioms 1 through 
5 seem equally acceptable in this interpretation. Axioms 6 and all versions of 
the independence of irrelevant alternatives—are open to the obvious criticism 
that adding a new act for the decision maker can affect the strategic position of the 
adversary and therefore the decision maker should reappraise the relative merits 
of the old acts. The minimax risk criterion of Savage, which was mainly criti- 
cized on the basis of its non-independence of irrelevant alternatives, should there- 
fore be re-evaluated. Rubin’s axiom 8 must be modified slightly in order for it 
to make sense in this context. Recall that game 1 is played with probability 
p and game 2 is played with probability 1 — p. Assume that in game 2 the 
payoffs to player 1 are constant within any column and that the payoffs to player 2 
are constant within any row (remember the game is non-zero-sum). In this case, 
the modified axiom asserts that the decision maker should behave in the same way 
both in the mixture of the two games and in game 1. This modified axiom seems 
just as reasonable in this context as the original did in its context. Axiom 9 
(convexity of the optimal set) is just as reasonable as before. Axiom 10 (the 
optimal set for the decision maker should not depend upon the labeling of states 
for his adversary) seems more universally applicable in the conflict context than 
it did in the original context. As originally proposed, this axiom was designed to 
capture the notion of complete ignorance, but no such interpretation need be 
implied by its use in the present context. Axiom 11 needs to be slightly modified: 
If two columns have identical payoffs for both the decision maker and his adver- 
sary, then one column can be deleted without changing the decision maker’s 
optimal set. Axiom 11’ is modified in an analogous manner. The modified 
axioms 11 and 11’ seem quite reasonable. 
Certain weak implications follow from these axioms. For example, the modified 
axiom 8 rules out the Hodges-Lehmann proposal, the maximin (utility) criterion 
‘ven modified for admissibility), and the Hurwicz a-criterion (even when applied 
‘0 restricted subsets of a priori distributions for the adversary). We cannot con- 
clude from any subset of these axioms, however, that all optimal acts for a specific 
Same must be best against some specific a priort distribution for the adversary. 
a 4 two-person game-like situation where each player knows his own payoff, 
et about his opponent’s payoff, the axioms designed for decision making 
eo can be interpreted directly: they are all meaningful—but not 
tee uy reasonable. In essence, a player can peat his opponent 's_ pure 
a as states, and his opponent's choice as the ‘true state.” Certain two- 
SBE oem, non-cooperative games—especially with imperfections of 
€dge—are close to this ideal type. < 
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estimate an unknown parameter (e.g., the mean of a normal distribution 


and the payoff is +1 to the statistician whose guess comes closest to the 
true parameter and —1 to the other statistician. 
considers only the case where an a priori distribution over the states (1.¢, 
the set of parameter values) is given, or where such a distribution can be 


partially inferred from past problems. 


Robbins, however, 
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Many conflict situations, not bon 
knowledge is limited (e.g., with respect to 


etc.), can be considered formally 
mieia . . . 7 
games however, are quite difficult to 
p] 
necessary to specify each player’s a prior 


nature. 


13.7 STATISTICAL DECISION MAKING—FIXED EXPERIMENTATION 


Classical statistical inference is usually compartmentalized into two 
categories: (a) the theory of testing hypotheses, and (6) the theory of esti- 
mation. The theory of confidence estimation is then introduced as a 
conceptual generalization of the theory of (point) estimation, but in tech- 
nical detail it is more intimately connected with the theory of testing 
hypotheses. For our purposes it will be easier to categorize inference 
problems according to: (a) the number of states of nature, (e.g., exactly 
two states, a finite number of states, a continuum of states), (6) the number 
of pure terminal acts available, and (c) the type of experimental evidence 
which is available or can be obtained. 

In each case, our strategy will be to reduce the statistical decision prob- 
lem to one of decision making under uncertainty. We will adopt the 
formulation of the statistical decision problem due to Wald [1950 a], and 
oe will show where the more classical formulations fit into the overall 
picture. 

Illustration. An example of a two-state, two-act problem where the 
type of experimentation is fixed. 

’ The diagnostic problem of deciding whether or not a particular patient 
is tubercular can be systematized as follows: 


State of Nature 


5, = Patient is sq = Patient is 
Tubercular not Tubercular 
A; = Assert Patient is [Classify tubercular Méisclassify a non- 
Act Tubercular correctly tubercular 
A» = Assert Patient is | Misclassify a tuber- Classify non-tuber- 
not Tubercular cular cular correctly 


ame to help decide which act to choose, an experiment & is performed 
saa subject (e.g., an X-ray, a sputum test, a guinea plg fest, or some 

Imation of these). Let 01, O2, °° * ,Or be the set of possible out- 
Comes? of experiment & A decision rule (or strategy) is an overall pre- 


9 
The set of outcomes {0}, O2, - - - , Or} is called the sample space of &. 


Making under Uncertainty 
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associates A; to the outcomes which are ‘“‘most likely ‘cur when s, 
° as =} oe — , 

is true and A, to outcomes which are “most likely” to occur when 59 is true. 

To formalize this, suppose that as part of the giv ens of the problem we 

are told the probability of each outcome when 5} is true and when », js 


true. To evaluate the decision rule D;, we first compute its performance 


under s; and then under 52. 
Let 


P(A | D;) = the probability that, when sj is true, experiment é 
results in an outcome for which D; associates act A) 
(i.e., the probability of D; resulting in a misclassifica- 
tion of an s; patient). 

the probability that, when s2 is true, experiment 6 
results in an outcome for which D, associates act 41 
(i.e., the probability of D; resulting in a misclassifica- 
tion of an s9 patient). 


P2(Aj | Dj) 


fication of a tubercular, wit 


respectively, Simi] 
with “prizes? arly, 


(Ao, 59), the co 


h probabilities 1. — P,(Ag | D,) and Py(A2 | Di)s 
the consequence of a (Dj, s2) pair is a lott’ 
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13. 
by 4 (linear) utility function, and let the utilities of the four basic conse- 
quences be 
S$} 52 

Ay {ui 19 | 

Ag | u21 u22 |, 
where an arbitrary but fixed choice of origin and unit has been made. 


Thus, the utility for the consequence 


i. (Di, 51) is will — Pi(Ag| D,)] + woiPi(A2| D;) = u(Dj, 51), say. 
ii. (Dj, 52) is u1eP2(A1 | Dj) + uooll — Po(A1| D,)] = u(D,, 52), say. 


With these assumptions and notations, our original problem shapes up 
as follows: To choose among the acts D;, Do, - - - , Dor (these acts in this 
modified problem are really decision rules for the statistical decision prob- 
lem), given the two states of nature s; and s2 and the utility payoff array: 


S] S92 
imu), 51) u(D4, 52) 
Do u(De, 51) u(Do, 52) 


(utility payoff). 


Der |u(Der, 51) u(Dar, so) 


In this formulation, the problem is nothing but a decision problem under 
uncertainty (d. p. u. u.), and our previous discussion is directly applicable. 
Now, let us return to the general case where there are n states of nature 
"h 8% °° * , s, and m acts A1, As, * * + , Am. The analysis of the exam- 
Ple can be extended in the obvious way, as we shall see. As part of the 

data of the problem we are given: 
mea eel payoffs u;;,i = 1,2, °°: m, wee DB neal a where 
ON y of the consequence associated with the act-state pair 
i). 
ot &xPetiment , and a probability distribution over the set of possi- 

omes of & for each state of nature. 

| oe tule D assigns to each possible outcome of & a unique act. 12 
| — © consequence of a decision rule D when s; is true, ie, a 
“ ae prizes are the consequences of the act-state pairs (Ai, 53), 
Probabilit ee (Am, 53). The probability of the prize (A;, s;) is the 
Prescribes 4, at, if 5; is true, an outcome of & will occur such that D 
i: Let this probability be denoted by P,(A; | D). Hence, 


Pt 
ther, ; 2 a F 
€ are r possible outcomes, then ther~ will be m” possible decision rules. 
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; i serve? 
In this case, what purpose does performing the _ amiaeee ‘ 
Since the likelihood of an outcome of & depends a es 
seems reasonable that, once the outcome of & is a ie peers 
assignment over the states should be altered. Let © be aN 
and let the conditional probability of s; given © be denote os i ns 
But now that & has been conducted and © observed, we e te Mie 
original problem of the optimal selection of an act when t ep a 
of the states of nature are known—however, these probabilities 


*]: 4 whi n 0 18 
P(s| ©), P(s2| 0), + - - » P(s,| ©). The utility of act A,, whe 
observed, is 


P(s1 | O)uir + P(s9 ee P(Sn | O)uins 
and the act which has the hi 
come © we can associate the 
utility payoffs—the wei 
tion, which associates 
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18 Recall that, in the Savage subjectivist school, an a priori distribution ' 
Meaningful and essentially given. 

“Tt can be shown that 


P(s;| ©) = 


P(O | s;)P(s;) 
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|s2)P(s2) +... +P(O | sn)P(n) 
where P(O | s;) is th 


bability (1 is exP™™ 
* . ie Proba! it ik iF 
Sion is known as B ¥ (likelihoo 


d) of © 4 ih 
) f ’ given that 5; Is true. 
ayes? formula 


Statistical Decision Making—Exper 


13.8] 
described above, is known as the Bayes de 
onl (Plss), Pls2), °°» Plsn)}. 

The following point can be easily verifies 
the Bayes rule maximizes the index 


P(sy)u(D, 51) + P(s2)u(D, se) + ° F P(sn)uWD, Sn 


which is associated to each decision rule D. This result nicely tidies up 


the loose ends of decision making under r7sk in light of additional experi- 
mental evidence. In short, the initial probability distribution over the 
states is changed to the conditional one given by the outcome of the 
experiment, and then one proceeds as in the case of no experimental 
evidence. 


> The final topic of this section can be given the elliptic heading “On the equiva- 
lence of two methods of randomization.” Suppose that D‘), D®, +--+, D” 
are different (non-randomized) decision rules. If experiment & has the outcome 
0, let D® (©) denote the act specified by rule D. A probability mixture over 
decision rules, (p:D, p2D©, - - - , pp-D™), where the p; are non-negative and 
sum to 1, is analogous to a mixed strategy. Operationally, if such a probability 
mixture is chosen and the experiment has the outcome 0, then act D®(O) is 
adopted with probability p;. We observe that, although p; does not depend upon 
0, D(©) of course does. 

Instead of taking mixtures over decision rules, a more general scheme is to 
define for each possible outcome of & a probability mixture over the acts. The 
number of acts used and the probabilities with which they are employed can 
depend upon the outcome. For example, if 0’ occurs we might adopt (4A3, 
As, 44g), whereas if ©’ occurs we might adopt (1442, 1446, ?6A9, 39413). 
Any rule of this type which assigns a mixture of acts to each outcome is called a 
randomized decision rule. The problem is this: Given a randomized decision rule, 
does there always exist an appropriate probability mixture of non-randomized 
Tules (i.e., rules that prescribe a definite act to each outcome) which will yield the 
pau results? Put another way, are we unduly restricting ourselves by first 
Considering non-randomized rules and then allowing probability mixtures over 
these, instead of allowing for randomized rules initially? The answer to the first 
question is Yes; to the second, No. We can exactly match any randomized rule 
ae probability mixture of non-randomized ones provided the set of outcomes 
ae eee and, even if the outcome set of & is infinite, very modest assumptions 
ae probability measures involved are sufficient to show that for each random- 

tule there is an equivalent probability mixture of non-randomized rules— 


cent in the sense that they yield the same utility payoffs for each state of 
re. < 


13 
8 STATISTICAL DECISION MAKING— 
EXPERIMENTATION NOT FIXED 


a8 Consider now the same type of problem as in the preceding section, 
“pt that the experiment is not necessarily prescribed in advance. 
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oo TOR econ Or fancier still, we might wish 
pee por va on ; f observation to be taken at a given 
to make the decision as to the type 2 

he previous history of experimentation. In short, 
cl art ] ll sorts of sequential or 
in the present framework we want to to erate a ss te 
non-sequential designs of experiments, questionnaires, ep - 2 : 
dures, etc. We only require that any decision rule (strategy) whic 
decision maker adopts for experimentation and for eventual termina 
action should be explicit in the sense that it must assert unequivocally, 
prior to any experimentation, exactly what is to be done at each stage as 
a function of the information available at that stage. Thus for each a 
(strategy) one can list, at least conceptually, all the possible outcomes 0 
experimentation and of terminal action. The problem in all its com- 
plexity reduces simply to a choice among decision rules (strategies). 

Let us first evaluate D’s performance when gistrue. Itis assumed a 
each possible outcome that is compatible with D can be given a uit 
index. This utility index will be a composite of two types of consider 
tions: (a) the cost of obtaining the particular outcome (including the oo 
of time, labor, materials, etc.) and (6) the losses due to wrong termina 
decisions. But, conditional upon the knowledge that s; is true, We = 
again (conceptually) compute the likelihood of each outcome which : 
compatible with strategy D. Thus, when 5s; is true, to each D we on 
associated a massive lottery: the Prizes are the consequences associated 4 
(outcome, $j) pairs weighted according to probabilities which are con 


puted on the basis of S;Svalidity. Let u(D, s;) be the utility of this Jottery 
then D is appraised by rs 


6s S] SQ Sn 
: [u(D, 5), UD; s2), +--, u(D, sa). 
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We, the decision maker, are given a coin whose probabilit 

heads or tails is unknown (to us). The coil is to be tossed by a specifi 

mechanism, and the outcome—heads or tails— will be noted by a repute 

ple outsider but not told tous. We have the choice of guessing heads or 


fails and our payoff is: 


A; = Guess Heads | Win $10 Lose $10 
Ag = Guess Tails 


As stated so far, the problem is an ordinary d. p. u. u. Now, let experi- 
mentation be introduced. Prior to making our guess, we are given the 
opportunity to observe this particular mechanism toss the given coin any 
odd number of times at a flat rate of c dollars per toss. Assume we must 
state in advance the number of tosses to be made. We will confine our- 
selves to decision rules that can be summarized by a pair of numbers 
(n, m), where n refers to the (odd) number of observations to be taken and 
where m has the following interpretation: if the number of heads is less 
than m, guess tails; if greater than or equal to m, guess heads. Intuitively, 
for any n, the most reasonable m is n/2, but let us not prejudge the problem. 

Since there is obviously an upper bound for n, the choice problem 
involves a finite number of decision rules. 

Let us make the assumption that repeated tosses are independent and 
that in the long run the ratio of the number of heads to tosses will “‘stabi- 
lize” to some number p, which will be interpreted as the objective proba- 
bility of the specific coin turning up heads when tossed by the given 
mechanism. The number p can take on all values from 0 to 1 inclusive, 
and each value of p will be identified with a possible state of nature. In 
other words, we have a continuum of states. 
ae evaluate the decision rule (n, m) under the assumption that p is true. 

at the utility of a dollars is a units (i.e., the utility of money is linear in 
money). Furthermore, let B(m, n, p) denote probability of getting at least m 


ty in n tosses when the probability of a head at each toss is p. ‘The evaluation 
decision rule (n, m) if p is true is: 


No. of 


Original Toss _ Heads in n Trials Utility of Outcome Probability 
H Less than m —10 — cn pil — Bim, n, p)] 
H At least m 10 — cn pB(m, n, p) 
T Less than m 10 — cn (1 — p)[1 — Bim, 2, p)] 
T At least m —10 — on (1 — p)B(m, n, p). 
, ae the utility of (n, m) when p is true (ie., u[(n, m); p}) one must sum the 
this - ee outcome times its probability over all possible outcomes. When 


and the expression is simplified, we get: 


ul(n, m), p] = —cn + 10(1 — 2p) + B(m, n, p)[20(2p — 1)]. < 


FTG ET 


a 


Individual Decision Making under Unce 


on rule according to the 


316 

The optimal decisi | e.... 
is not to take any observations whatsoever rega ‘ fa 
were as low as 400 observations per penn) a call 
that, regardless of the outcome of eo remain 
the possibility that p = pen in that case kn thal 
If we are completely pess! 


mistic in outlook why eile 
i c 7 robab = 
soever sampling? Just take heads with p i sites 


one is completely optimistic. Then the best p is of weve 
exactly 1 observation, we are sure to determine Ww hick The Hurwie 
a-criterion asserts that at most one observation Is €\ sary, and it 
should be taken only if a, the optimism-pessimism index, is greater than 
90 c. In other words if ¢ is a penny, then one should take 1 observation 


only if a > 0.2. 

These solutions both go counter to intuition, and therefore the reason- 
ableness of the maximin and the Hurwicz a-criteria is further cast into 
doubt. The example illustrates a major criticism of these criteria, namely: 
They focus so strongly on the best and worst states of nature that often they do not 
permit one to gather negative information about the plausibility of such states, n 
matter how slight the cost. 

The minimax risk criterion, on the other hand, does not turn up its 
nose so easily at cheap experimental evidence. Recall, however, that a 
major criticism leveled at the minimax risk criterion is that it does ne! 
ad the independence of irrelevant alternatives axiom. Radner and 

r 4 . . 
a... illustrate this in terms of the above example as follows: Su’ 
cisa ae it Ce 
et oe if we are confined to rules of the form (n, n/2) (1¢™" 
ads : ots ae 
she ris ot only if the majority of the n tosses are heads), then the 
minimax risk criterion yields h : ations Witt 
Srrhability 0.9 and 21 as the optimal rule: 19 observation 
9 an : : ns rer, We 
Sitch observations with probability 0.1. If, howev* we 
y choose from all rules of the f i ricted (0 
n/2, then the minimax risk cr; e form (n, m), where m is not restric” 
Sci 0 X risk criterion suggests taking 37 observatiO! 
ity 0.2 and 39 observati : 2 + tg the 
interesting point) head ) ations with probability 0.8 and (ths ® " 
abit €ads should be adopted “¢ bonds appears ™ 
a majority of the observed tos eee aed only if fears f chet 
. SeS, Thus, in this case when we add q ric 
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(cet of states ofnature). ‘This may 
set of s 


practical examples it is not mather 
niques; however, it is not uncommon 


specific inference problems 
u(D’, s)<u(D”, s), 


(ie., D” strongly dominates D’) without ever 
values of u(D’, s), u(D’’, s) for any s. 

This observation suggests the following definitions which are employed 
by statisticians: 


i. A complete class of decision rules is a subset Dy of D such that for every 
D in D but not in Dg there exists a D’ in Do which weakly dominates D. 
[That is, 

u(D’, s) 2 u(D, s), all s in S, 


and > holds for some s]. A statistician has nothing to lose if he confines 
his attention to a complete class. 


ii. A minimal complete class of decision rules is a complete class such that 
no proper subset of it is also complete. 


Recall that a decision rule, D’, was said to be admissible if D’ was not 
weakly dominated by any other rule Din. In decision problems where 
the sets of terminal acts, states of nature, and outcomes of experimentation 
are all finite, it can be shown that the set of all admissible rules forms a 
minimal complete class of decision rules; however, in more complex cases 
this need not be so. Consider the trivial counterexample where there is 
exactly one state of nature, no experiment, and a countable infinity of 
terminal acts A, Ao, - - « ; let the utility of A4;be 1 — 1/7. Hence A) is 
weakly dominated by As, which is weakly dominated by As, etc. In this 
example, every act is weakly dominated by some other act, and so no 
admissible act exists. Obviously, there exist complete classes. For 
xample, {A;, Aj41, °° *} isa complete class, but so is {Aj41, 4j42, °° *}, 
and we easily see that there is no minimal complete class for this problem. 

Statistical analysis of a decision problem is usually broken up into two 
Parts, 

Part1. To ascertain the existence of a minimal complete class, and to 
characterize it; or, if no minimal complete class exists, to characterize a 

reasonably small’? complete class. 

Part 2. To select an “optimal” decision rule from a complete class. 

Although our discussion of various decision criteria has been directed 
waainly towards the problems of part 2, the bulk of current research pub- 
“Cations in statistical decision theory are devoted to topics in part 1. 
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with an infinite number of pure strategies (which are briefly discussed in 
Appendix 7). Sizeable portions of Wald’s Statzstical Decision Functions and 
Blackwell and Girshick’s Introduction to the Theory of Games and Statistical 
Decisions are devoted to (1) the existence theory for two-person games with 
an infinite number of strategies, (2) the relation of complete class theory 
to game theory, and, (3) the applications of 2 to classical statistical infer- 
ence problems. 
Recent contributions to the existence theory for games with an infinite 
number of pure strategies have had the peculiar effect of minimizing the 
importance of games in statistical decision theory. The mathematical 


techniques employed in these game theory papers can be applied directly 
to existence questions in statistical d 
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wishes tO investigate statistically prior to making any recommendatior 
Let us Suppose they are willing to employ the following model: For those 
who have and have not been vaccinated, let py’ and p ” denote, sind 
ly, the “true” probabilities that an individual chosen at random fron 
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tive b : som 
the population will contract the disease within a fixed period of time. 


Let do = po /ps. In practice it is quite possible that pj" 


may be epidemics of the 


vary from time period to time period, since there 
disease, etc., but for the purpose of this analysis let us suppose that their 
ratio 4‘ remains invariant. 

The following three problems are traditionally considered: 


i. To test the hypothesis that the true value \“” is greater than some pre- 
assigned quantity \* versus the alternative that it is smaller than or equal 
to A*. 

ii. To point estimate the value of \‘° in the sense of guessing, on the basis 
of a sample, a number which is “close” to the true value \“. 

iii. To interval estimate the value of \°° in the sense of guessing, again on 
the basis of a sample, an interval which has a “good chance”’ of containing 
the true value \ 


The solution to any of these problems is not usually an end in itself, but 
rather serves to influence a policy decision. Although it is true that the 
real world terminal actions which can be employed, the losses due to 
Wrong terminal actions, and the costs of experimentation are not explicitly 
introduced into any of the problems as formulated, such considerations 
will certainly influence some arbitrary procedural commitments which 
must be made to resolve such problems. This will soon be evident. 

Tn what follows we let E be a generic symbol for the strategy of experi- 
mentation, and we let D denote a typical decision rule which associates 
to each outcome of the experiment an appropriate guess for the particular 
Problem at hand. We consider the three problems separately. 

First, the testing of an hypothesis: Suppose a strategy pair (E, D) is 
. then whether we guess that \‘ > X* or not depends upon the 
en outcome of an experiment, anepahe prebability of that outcome 
a, (200) one upon the true states p;"’ and po : Of course, we do not 
kes i po ); nonetheless, by a probabilistic analysis, we can in 
(0) oe etermine the probability that Dy leads to the guess that 
i » Slven the assumption that (p;"’, py » is a specific pair of num- 
(Pi, py»). Symbolically, we can denote this probability as 


P(»;,p2)[guess that \© < \*, given (E, D)]. 


x When 1/p2 < \* this probability should be (close to) 1, and 
j bi/2 > \* it should be (close to) 0. By complicating E in one way 
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a called a significance level. Often a = 0.05 or 0.01 are used. Ore 
then demands of an (£, D) pair that 


P(p,,pylguess that A” < d*, given (E, D)] < ap, 


for all (fi, p2) such that pi/p2 2 A*, and that for certain specific pair 
(fi, p2) with pi/p2 < d* the resulting probabilities be “reasonably” large 
The more sophisticated problems of inference center about the choice o! 
E, rather than about the choice of D for a given £. 

When the testing-of-an-hypothesis procedure is used, presumably two 


possible policy actions are contemplated—one for the guess that \'” ¢\" 
and the other for \( > )*. 
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In classical estimation the following tac! taken. Fo > trip! 
[(p1, P2)s E, Di), let m[(f1, p2), E, D| denote the ‘aN (OY a +) of the 
p.d. f. associated with it, and let o[(1, p2), E, D] be its standard deviation 
Our hope, in this case, is to choose E and D so that 


{ml (p1, p2), LE, D] — pi/p2} and ol(f1, po), E, D] 
are both small. In case (E, D) is such that 
m|(p1, p2), E, D) = pi/p2, _ for all pu, po, 


the estimation procedure is said to be unbiased. For unbiased estimators, 
a reasonable index of the performance of (£, D) is the standard deviation. 


Probability density 


A=p,/P2 
Fic. 1 


More generally, a weighted average of {m[(p1, p2), E, D] — £1/p2} and 
ol(p:, p2), E, D] is used as a measure of performance. These classical 
Procedures do not explicitly incorporate into the formal model those initial 
Conditions which state the losses due to incorrect terminal decisions; one 
Must try to take such considerations into account by the interpretation 
Slven to the measures being “‘small.”’ This gets confusing in some situa- 
ag for example, in some cases it may be much worse to overestimate 
d”” than to underestimate it by the same amount. 
ore modern work attempts to take the consequence of erroneous ter- 
eg decisions into account. It is assumed that as part of the initial data 
a 8iven a function L, where L(f1, 2; A) represents the loss (disutility) of 
X for xO when A‘ really is pi/pe. Thus, if E leads to an out- 
ig Lp to which the estimator D associates the guess D( 0), then the loss 
1, 2; D()] when (4, p2) is actually true. To appraise (FE, D) with 
Wrong guesses, but not other factors such as the cost of experi- 
nN, we can use the expected value of the loss, i-e., L[p1, p2; D(O)] is 
ed by the probability (likelihood) that E leads to © when (fy, fo) 
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indeed, be a “poor guess” in actuality. A more sophisticated analysis 
should yield a probability statement concerning the deviations between 
the guess and the true value. From this knowledge, one can compute 
the amount of “confidence” to hold in the estimate. For example, sup. 
pose we know that, regardless of the true value (p{°, 5°), a strategy 
(E, D) will lead with a probability of 0.99 or higher to a guess which does 
not deviate from \‘” by more than 0.05; then this information can and 
should be exploited in policy decisions. It is such observations which led 
to the theory of confidence estimation. 

Let the rule D associate to each outcome © of experiment £ an interval 
of values instead of a unique guess for \‘°; this interval we may denote by 
D(©) (note the change in meaning for this symbol). ‘To be useful, such 
an interval should both be small and include A“. In much the same way 
as before, we can ascertain for each (3, p2) pair the probability that £ 
ae outcome to which D assigns an interval containing ae 
Re Property that for cach (Pu fe 
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i . a means that if a given experiment 99” 
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Josses due to wrong action, etc., may not have been thought through at 


time of €X 
jem, the confidence interval statement constitutes a p inary re] 
on the outcome of the experiment. Furthermore, to introdu 

, problem is not usually easy, SO the fewer actions that have to be con 
sidered the better; sometimes knowing a confidence interval helps t 
eliminate certain of the feasible terminal actions, thereby reduc: 
of the delicate and time consuming appraisals of lo 

To a growing number of statisticians, this defense of confidence inter- 
yals as a preliminary to the second stage of analysis seems weak and 
spurious. If a confidence interval is merely to be used as a tool to sum- 
marize experimental observation, many would claim that there are more 
informative summaries of statistical data which, when used by an expert, 
can be efficiently converted into action choices. Others hold, however, 
that, although often there are technically more informative summaries 
than confidence interval estimations, the latter are especially easy to 
understand, to assimilate, and to internalize. This point is usually 
shrugged off by the remark, “‘It is but a matter of training.” 

In summary, the essential difference between classical inference and 
modern decision theory is this: only in the latter model is a formal attempt 
made to incorporate the actual terminal acts and the specific economic and 
psychological losses attributable to wrong terminal decisions. ‘The 
modern work attempts, in a sense, to come closer to the real world prob- 
lem by introducing into the framework of the model more of the initial 
conditions. The classical work, on the other hand, left many of these 
considerations outside the formal model, only to incorporate them indi- 
rectly and informally via such concepts as significance levels, confidence 
levels, and lengths of confidence intervals. 

Thus, it is held, the modern theory is handicapped as an inferential tool 
In scientific research. How can a scientist realistically appraise the losses 
from falsely rejecting or accepting a research hypothesis? Or how can he 
evaluate the losses in estimating a parameter when this estimate may be 
used for a variety of purposes—some of which may be unknown or irrele- 
Vent to him? Since these evaluations do not seem possible, perhaps one 
should not assume them known. Furthermore, in scientific reporting the 
Problem often is to select an experiment whose results are most likely to 

© maximally informative in some sense, not to arrive at an explicit 
terminal action. In that event, it is argued, a classical approach is much 
More sensible. 

Decision theorists make several rejoinders. Even though the classical 
ie does not explicitly introduce losses, eventually they appear implic- 

» for otherwise how can one decide upon significance levels, confidence 
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13.11 SUMMARY 

An individual decision-making problem under uncertainty (d. p. u. u,) 
where the act and state spaces are finite, was formulated as follows: given 


an m by n matrix [u;;|, where u;; is the person’s utility for the consequence 
associated with act A; when nature is in state s;, find the subset of acts 
which are in some sense “optimal.” Of course, the intriguing problemis: 
what constitute reasonable criteria for optimality? 

If one can say that the individual has an a priori distribution over the 
aes of nature, the problem becomes one of decision making under risk. 
ee with by calculating the expected utility of the acts and 
the chapter with the -. ie n. * ae aie : vie 
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In section 13.3 we discussed requirement 
y refer to the individual’s knowledg: 
wa ; . . 
nature. Axioms 1 through 5, and to a lesse: 
ys and, so far as we are aware, all serio 
0 : 
them. These axioms are: existence (1 
formations (2); invariance under labeling of act: 


property (4); admissibility (5); and irrelevance of weakly dominated ac 

3 roo Aa Te if MEER 
(6). Axioms 1, a 2 7 ” and 7 were variations on the theme o ne 1nac- 
pendence of irrelevant alternatives. Axiom 7—a non-optimal act shall 
not become optimal through the addition of new acts—suffices to rule out 


the minimax regret principle. Axiom 8, which implies that a constant 
added to a column shall not alter the optimal set and, together with 3 and 
7, implies that the optimal set shall be anticonvex, rules out the maximin 
utility (minimax loss) criterion and all the Hurwicz a-criteria. For the 
Hurwicz a-criteria, Axiom 9—convexity of the optimal set—implies 
a = 1,i.e., the maximin utility criterion. Axioms 1, 3, 4, 5, 7’, 8, and 9 
imply that there exists an a priori distribution which is independent of any 
new acts that might be added and which has the property that any act 
which is optimal for the problem must be best against this distribution. 

The notion of complete ignorance, which was bypassed earlier, was 
captured in section 13.4 by two axioms: invariance of the optimal set 
under labelings of the states of nature (10) and under deletions of repeti- 
tious columns (11). Chernoff has shown that the criterion based on the 
principle of insufficient reason is characterized by axioms 1 through 10. 
The Hurwicz a-criteria, modified by adding an admissibility requirement, 
satisfy all the axioms save 8 and 9, and they are the only criteria satisfying 
axioms 1 through 4, 7’, 10, and 11 plus a continuity requirement for simple 
choices, 

If we demand that a criterion completely order all of the acts instead of 
selecting an optimal set, then the analysis is much simpler. We presented 
Milnor’s axiomatic analysis, which pithily characterizes such criteria (see 
chart on p. 298), 

hat is known of the no man’s land between complete knowledge and 
“omplete ignorance was the topic of section 13.5. If one endorses 
ee 1 through 6, 7’, 8, and 9, then, as we have said, all optimal acts are 
spinal ee GieStaaiicilawa priori probability distribution over the 
i S of nature. Following Savage, we indicated how this distribution 
iba: Senerated from consistent answers toa hypothetical list of simple 
ie We also mentioned suggestions of Hurwicz and of Hodges and 
n for coping with partial ignorance. 
inte © appropriateness of these axioms when nature is replaced by an 
Sent adversary was explored in section 13.6. Some were not 
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If an a priori probability distribution over the states ” a at ow : 
then experimental evidence merely alters it (by means of a aa 
after the experiment, any act is chosen which is best against the new—the 
a posteriori—distribution over the states. If an a priort distribution is not 
known or assumed, then the analysis is a bit more involved. Instead of 
choosing among terminal acts, we now choose among decision rules, 1-¢., 
among rules which associate to each possible experimental outcome a 
specific act. ‘The consequences of any decision rule-state pair is a well- 
defined lottery, and so it is evaluated by its expected utility. This proc: 
essing of the experimental possibilities reduces the statistical decision 
problem to the previously considered problem of decision making under 
uncertainty. 
If several experiments are available, the selection among them, as well 
as the choice among actions, can be incorporated into the decision ™® 
Of course, the consequence now of any decision rule-state pair is 4 more 
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Group DECISION MAKING 


14.1 INTRODUCTION 


Democratic theorists, economic as well as political, have long wrestled 
with the intriguing ethical question of how “‘best’’ to aggregate individual 
choices into social preferences and choices. In this chapter we shall sur- 
vey some recent formal contributions to this topic. 

he next three sections of the chapter are devoted to describing Arrow’s 
asic work in formulating a problem of social choice and to stating his 
Central (negative) result. This theorem—the voting paradox—is exam- 
Ined at some length in section 14.5, and various schemes for avoiding it are 

“ctibed. One class of schemes—those which utilize individual strengths 
of Preferences—is of such importance and so thoroughly interlocked with 

€ unresolved question of interpersonal comparisons of utility that we 
Ve treated it separately in section 14.6. The next two sections deal 
with two aspects of majority rule: conditions restricting the possible indi- 
Proce ferences in such a way that majority rule is a plausible voting 

° and its game-theoretic features. The penultimate section is 

to games of “fair division,” and the final one is a summary. 
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we shall examine. As it is a basic et 16 followin 
three sections and since we shall concentrate on!) aspect 
Arrow’s work, we urge the reader to go to his volt - ical Weis 
ground material and for guidance through the litera e have taken 
the liberty both of altering his notation slightly « making minor 
modifications in his formulation. 

In its most general terms, the problem is to define “‘fair’’ methods fo 
amalgamating individual choices to yield a social decision. As interpreted 


by Arrow, this becomes a question of “combining” individual preference 
patterns over various states of affairs to generate a single preference pat- 
tern for the society composed of these individuals. Some of the more 
common methods or procedures for passing from the preferences of indi- 
viduals among social alternatives to a preference for the society are: con- 
vention, Custom, religious code, authority, dictatorial decree, voting, eco- 
nomic market institutions, etc. Not all of these are usually considered 
fair and representative schemes, so part of our task will be to decide what 
we might mean by a procedure which takes into account the welfare 0 


the members of society, i.e., by a “welfare function.”” We may best indi- 
cate those aspects of the problem we wish ? 
some very simple exam 
the next section, 


to abstract by first considering 
ples; this will lead into the general formulation 0 


Consider a society of two ind 
ences for the two 


(2) prefer x to Ys 
These three Cases 
reason for this not 


ividuals, 1 and 2, each of whom have prefer- 
Possible alternatives x and y. An individual can either 
(0) prefer y to x, or (c) be indifferent between * and J: 
are designated by R}, R®, and R®, respectively. (The 
ation will appear later.) Schematically, we have: 


z 
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y ; 4 
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alternatives ‘s ab “ ef t € set of possible preference relations over : 
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the members of this society. Ihe set of ordered pair 


TABLE 14.1 


Re R$ R3 Ri R3 Ri 


The first two columns of Table 14.1 list all of the elements of R XQ, i.e., 
all the possible ordered patterns of preference between two alternatives for 
a society of two individuals. The third column, labeled Fj, associates to 
each element of R XR an element of R. Thus, the third column exhibits 
one method of amalgamating the individuals’ choices into a social prefer- 
ence pattern. For example, procedure F; sends (R?, R°) into R?, and the 
interpretation is this: if 1 prefers y to x and 2 is indifferent between x and y, 
then rule F; “combines” these choices into a social preference for y over 
*. Columns F>, F3, F4 represent other possible procedures for passing 
from tastes of individuals to the choice for the society. Procedure 2 can 

© characterized as an imposed procedure since the choice of the society 

oes not depend upon the choice of the individuals. In procedure F3 the 
oo of the society depends only upon individual 1’s choice and not upon 
ping we may term it dictatorial. F4 does not seem a reasonable pro- 

€; nevertheless, it is a method of amalgamation. 
% pee case, the F; represents a function with all of R X@ as its domain 
baie Me and with ®asitsrange. Naturally, there are other functions 
to ®; indeed, there are 3° = 19,683 such functions. Most of 
le like P. 4, are more appropriately referred to as “‘illfare”’ 
intuitivel n “welfare”? functions. This raises the problem: what can one 
Y mean by a “welfare” function? 


Decision Making tg 


4 


Group ire | indivia 
a for example, the following should neWidug 
: r ; uld h MMOon pre 
Consider, : preference, then society Ss . AMON pref 
es atlve ne re WUire 
have the sam our two-person two altern  , | Teauite 
erence. For Ri R}) — R}, (where the — | sent into 
to: m ti ber of pncci} 
ment amounts (R® R°) —_ R?. It cuts een : of POSSiD]e 
ek xR into® from 3° to 3°. Ho ve accept thi 
functions mapping ® n also argue that it is reaso © to insist that 
ae 7 a ) 7 
restriction, then we cal ° R2. First, note that (4°, &°) results from 
1 R) not be mapped into aative x. If society wa 
(RR) 0 hanging his mind in favor of alternative x. If society was 
3 ngin : fe Go 
ieee ) by t changns een x and y (which follows from the first require. 


eageally — iety to change in favor of y when 2’s choice 
) aeeaehell aati of x. It is by “‘reasonable”’ conditions 
ae a4 a Bi to eliminate many of the initially possible 
al  ccry of “welfare”? functions. In section 14.4 the 
ill be given. 
a ie idea of what we shall be doing, let us it: out 
several aspects of the general problem that we shall not dieens : ee 
pose that F, is the method of social choice used in the society tee : 
1,2. If 1 prefers x to y he will certainly not state this, ie., 2, for ihe 
ensures R! is not chosen by the society. He will register either R* or R 
If he thinks that 2 will say R’ or R?, he will choose R’; if he thinks that ¢ 
will say R*, he will choose R®. Such strategy aspects of the problem will 
not be discussed. We will assume that the choices of the individuals “ 
part of the data of the problem. Alternatively, we might try to need 
condition on the function to the effect that it never benefits an individua 
to misrepresent his actual tastes; however, this idea seems to be very ‘if 


cult to formalize, and no attempt will be made to use it. 
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4, 
| ities, and we prefer first to analyze the problem of social choice without 
complications. Later we shall consider how the model change 
a such features are introduced. / | 
Returning to our problem, one can gain a sense of its full complexity by 
examining three alternatives and a society of three people. Let @ cons st 
of three alternatives labeled x, y, and z,i.e.,@ = {x,y,z}. All the possible 


preference-or-indifference relations on @ are: 


TABLE 14.2 


por Rk R* (DE a ad S ite ke he Re nS 
moe. y 2 2 x y Zz x VE: ZamWime Lie Mim Wy Be 
nent oY yo ew X—- ZK y z y i 


y & 
| 

Thus in R°, for example, y is preferred to both x and z, and x and z are 
indifferent. Note that, for every pair of distinct elements uw, v belonging to 
@ (i.e., u is either x, y, or 2, and v is either x, y, or z), either u is preferred to 
yor v is preferred to u or uw and v are indifferent. Further, preferences are 
never allowed to be intransitive. Of course, in actual practice an indi- 
vidual may have intransitive preferences (e.g., x preferred to y, y preferred 
to z, z preferred to x), in which event this model does not apply. One 


way to avoid intransitive responses is to insist that the individual choose 
an element from the set 


R= ae, RR. RP, Rasim ig agai 
i¢., rank-order the alternatives. 
As before, to each ordered triple of elements of ® there is a correspond- 
Ing pattern of choices for the individuals, and conversely. ‘Thus to the 
triple (R*, R®, R?) we have: individual 1 prefers y to x to z; 2 prefers y to x, 
) to z, and is indifferent between x and z; and 3 prefers x toz toy. Thus, 


a function F from ® XR XR into R can be interpreted as a procedure for 
Passing from the individuals’ preferences to the social preference. 
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As the number of alternatives and people are increased beyond three it 
a more and more cumbersome to exhibit all the possible preference 
i, on the alternatives. Therefore, a simple, compact terminology 

Pplicable to all cases is needed. The following is suitable: 


i Alternatives. et @ = {x,7, °° - , z} bea set of alternatives. 
> ¢ Individuals. Let the individuals of the society be denoted by 1, 


ry . 
tae. ? Cy Ea 
N. Preferences. 


and For each individual z and any alternatives u and v, one 
Only one of t 


he following holds: 


on Making 


a E IMs 
» which is written as uly, 
(a) “t prefers w to 2 which voila o. 
«* nrefers v to u,” which 1s written as vf ju, 
a, dv,” which i pee 
(c) “iis indifferent between uv ane ", A is 3 uly, 
5) = D. al 
If “i does not prefer v to wu” we write is Equivalent to 
so demand that ¢éa ividuan’ 
either uP or ulw holds. We al i. lual be Con. 
sistent in his preferences 1n the sense that £;, /;, each assumes 
to be transitive. ) 
We have already seen that for two alternatis es are three possible 
preference orderings and for three alternatives there are thirteen possib}, 
i = {Ri pR?... 
preference orderings. In general, let R = {R, K’, , RB! be the 


possible preference orderings of the alternatives, where m depends upo 
and increases rapidly with the number of alternatives. By a profile 
preference orderings for the individuals of the society, we shall mean " 
n-tuple of orderings, (Ri, Ro, ° °°, n), where &; is the preference order. 
ing for the ith individual.! The set of all possible profiles of preference 
orderings will be denoted by R” = RXRX--- XR. 


iv. By a social welfare function (or “‘constitution,’’ or “arbitration 
scheme,” or “conciliation policy,” or “amalgamation method,” or “voting 
procedure,” etc.), we shall simply mean a rule which associates to each 
profile of preference orderings (i.e., to each element of ®“?) a preference 


ordering for the society itself. If F denotes such a rule we shall sy- 
bolically write: 


F 
(Ri, Ro, ee Ry) —e R, 


which means that rule F “combines” the profile of orderings (Ry Rx 


** +, Ry) to yield the ordering R for the society. 


a Nori oe vi many conceivable functions from ®? to &; it 
requirements saan € mathematics as a social problem, there eee 
ically” acceptable ee tld Satisfy before we would consider e. ye 
purely subjective " ne requirements of ‘ethical acceptability ¥2) 

valve Judgments not subject to mathematical derivatio” 


however, j 
» NM a mathemati esi’ 
€rata have to be phr cal treatment of their implications such 


We aia ac. on as explicit Mathematical statements. _ wah 
versal ethic, or even Sin to imply that there does or should exist fe 
invariant over diff, at an individual’s “ethical standard” sh, 

erent social Choice problems. When we assert is? 
nn on F is “reasonable.” all that is imP mal 
subscribe to the a some situations for which Saough sndividua 


nditi ication® 

* Note the different ro} ns to warrant investigating their implica” ne? 
relation in the listin le of subscripts and su : < the ith Pr cles fot 
Dereon 73 & RK of all of them Perscripts. * means ce rela io? 


3 Whereas R; means the preferen 
idual preferences. 


mum (NY 


Ae Arrow 
Imposing conditions on F of course 1 
functions. Arrow has shown that som« 
ments narrow F down to the point whe 
However, he has also shown that, if the domain o! 
restricted (single-peakedness condition, 
desirable welfare functions do exist—sociall 
sense of satisfying the stated prerequisites. 
Summarizing, the procedure to be adopted is as follows: There are 
many specific social welfare functions which are well defined; however, 
particular functions are often adversely criticized because they fail to 
satisfy some “socially desirable” criteria. Hence, instead of considering 
specific functions, we shall attempt to capture our intuition of “‘desirable”’ 
by explicitly stating properties that any social welfare function should 
satisfy. We shall examine a set of conditions, each of which individually 
has merit as well as a long and illustrious history, but which collectively 
are inconsistent, i.e., no social welfare function exists which fulfills all these 
conditions. 
Assuch a conclusion may seem odd, the following example should serve 
to suggest why it is not so trivial to find “reasonable” welfare functions 
Simple majority rule. Corresponding to each profile of preferences, let 
society prefer the alternative u to the alternative v if and only ifa majority 
of the individuals prefer u to v. The following profile of choices for 
alternatives x, y, and z illustrates the difficulty with this rule: 


R} R* R® 
+ y Zz 
y ve 

rh x y 


Since a majority of individuals prefer x to y, y to 2, and z to x, society 
selects the intransitive relation: xPy, yPz, zPx. Consequently, the rule 
es not tell us what element of & to associate to (R', R*,R*). To be sure, 
We could alter the simple majority rule to let (R’, R’, R*) map into R™, ie., 


ae but then we would have a different rule—one which has other 
ig, 


1 
4.4 CONDITIONS ON THE SOCIAL WELFARE FUNCTION AND 
ARROW’S IMPOSSIBILITY THEOREM 


a a where the number of alternatives is one or two can be easily 
a. ly. Indeed, for one alternative no analysis is necessary! 
al eo shall confine our attention to situations having three or more 
& es. The number three plays an important role, since intransi- 


Wities 
Can only occur on three or more elements. 


Making 
[14.4 
rule led to an 1 
Id be of no impo 
tha 
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It is true 
profile (R y 


that simple majority 
R R®), but this wou 
bers if we had reason to believe 


m me 
— occur. For then we need not den Metion 
ee eeiiof G@”, but only on a certain res FRO op 
smaller the domain, the easier it is to constru ble” Or 
functions. In the extreme when only complete agi illowed, 4 
task is trivial. Some groups, possibly because ot s ion or ee 
of a common ethic, do not often exhibit a wide div of opinions, Ne 
for such societies majority rule probably never will arrassed by fe 
intransitive set of orders. For more discussion of th int, see section 
14.7. 

We choose, however, to confine our attention to the mathematical} 
more interesting case where the domain of the welfare function js ital 


enough to make our task formidable. For ease of presentation we shall 
assume that the domain of the welfare function is all of ®™: i.e,, we 
require the function F to be defined for all conceivable profiles of indliibiel 
orderings of the alternates. Actually, the other conditions that will be 
imposed on F continue to be incompatible even when the domain is 
restricted to certain proper subsets of ®™). However, certain of the 
one domains given in the literature (Arrow [1951 a] and Inada [1954, 

>]; Go not allow one to establish the contradiction asserted, as has been 
pointed out by Blau [1957]. 

We summarize the above discussion as: 


Condition 1, 
or equal to three. 


(6) The social w / : : 
orderings, elfare function F is defined for all possible profiles of individual 


Di} : 
(4) The number of elements (alernatives) in @ is greater than 


(c) There are at least two individuals 


d for th 
es ” : 
€cond condition we consider a special example, 


. 2}, and let . 
$ the so : 
ider the pattern (Ri Pll piz ciety have exactly three members. ° 

> b] R )y namely: 

R! 

Ru 
R}3 
ie Pe 
y F et id 
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and suppos 
€ that F, ; 
for t olsaw 
he Profile (Ri, RU oo function such that society prefers J a8 
2 . 11 ; 


Namely: ce) i : 
W consider the profile (R's ® » 


1l 


Bs 


Arrow’s imposs1bDilt 


14.4] 


: fentt pls) 3. mcdifecd | ; 

If the first profile, (R’, R'!, R*°), is modified | and t} 
bers’ pushing y Up while keeping x and z fixed, then the secon 
in”, a R°), results. It thus seems ‘‘reasonable”’ 


tl. a+ yee ee lantea 4 
Lilal, ICE £9 sCsletls 


in preference to 2 for the former profile, it should also do so in the latter 


one. Stated alternatively, a social welfare function is not “reasonable” if, 

when the members choose (R}, R11, R**), society prefers y to 2, and when 
bs . _ 

the members choose (R*°, R11, R8), society does not prefer y to z. 


Condition 2 (positive association of social and individual values). Jf 
the welfare function asserts that x is preferred to y for a given profile of individual 
preferences, tt shall assert the same when the profile is modified as follows: 


(a) The individual paired comparisons between alternatives other than x are not 
changed, 


and 


(b) Each individual paired comparison between x and any other alternative either 
remains unchanged or it is modified in x’s favor. 


To arrive at the next condition, consider the case of four alternatives 
w, x, y, z and suppose that for some profile of individual preference a 
specific welfare function Fo states that society prefers alternative x to 
alternatives w, y, and z. That is, given the particular choices for the 
individuals, the rule Fo says that x is the “best”? (most preferred) alterna- 
tive from the set {w, x, y, z}. 

Now suppose we restrict their consideration to the alternatives {x, y, z}. 
Presumably, the individuals might change their preferences among x, y, 2} 

ut suppose they do not! If all paired comparisons made by the indi- 
aa ern elements Xs Jy 2 do not change, then isn’t it “reasonable” 

: . at, since x is socially best in {w, x, 9, z}, it is also socially best in 
ee To be very concrete, let us quote Arrow [1951 a, p. 27] on the 

er method of voting used in clubs: 


pets finite number of candidates, let each individual rank all the candidates, 
> ie first choice candidate, second choice candidate, etc. Let pre- 
the ee ts be given to the first, second, etc, choices, the higher weight to 
Votes be = Oice, and then let the candidate with the highest weighted sum of 
candidates ected. In particular, suppose that there are three voters and four 
choices be He z,andw. Let the weights for the first, second, third, and fourth 
candidates 2 ae and 1, respectively. Suppose that individuals 1 and 2 rank the 
5 eae n the order x, y, 2, and w, while individual 3 ranks them in the order 
is deleted re Under the given electoral system, x is chosen. Then, certainly, if 

rom the ranks of the candidates, the system applied to the remaining 
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hould yield the same result, espec ial; 
fis astes of every individual; but, H 
would yield a tie between - 


candidate 
to x according to the t 
cated electoral system 


s ) etl I 
As another example, let @ = ay, z}. | 
R}° R} 
a ‘i 
A yy 
and 
R° R? 
z ms y 
“ae z ae » 
y 


the preference relations between x and y are identical for each individual 
so the argument is that for both profiles society should reach the same 
choice between x andy. ‘The counterargument, however, notes that when 
2 changes from 

x 
to Zz 

By 


he seems to be indicating that he prefers x to y “‘more’”’ in the latter thanin 
the former pattern. 


NS 


ee That is, alternative z is not irrelevant in appraising the 
strength” of preferences for x versus y, and so it is not “unreasonable” for 
society to prefer y to x in the first profile and x to y in the second. In 
il voting example, the claim is that when w drops out of the race the 
| ane the preferences of the individuals is changed, eve 
ie Sikes aioe Judgments remain constant. ouitedil 
further illustrates ee # taken from Goodman and Markowitz ie , 
willing to serve either eee “ host has two dinner guests to ‘a nid 
each prefers, coffee or ae tea but not both. Instead of pee 
Miia. » this subtle host gets them to rank a whole 
profiles of responses are: 


1’s 
Preferences 2’s Preferences 


Coffee Tea 
ag Coffee 
Profile 1 ey Rie 
Hot chocolate Lemonade 
ee vela Coca-Cola 


Hot chocolate 


14.4] 


and 

1’s Preferenc 

Postum 

Milk 

Coffee 

Profile 2 «Tea 

Lemonade chor 

Hot chocolate Mil 

Coca-Cola Coffee 
For profile 1, the host deems it fair to serve coffee and for profile 2 toserve 
tea. He reasons that the other drinks are not irrelevant, and their intro- 
duction permits him to appraise the relative intensities of preferences for 
coffee versus tea. 

Yet, is such reasoning really plausible! One can argue that this pro- 
cedure introduces a notion of interpersonal comparison of utility, and, 
if it is desirable to do that, then this is surely a naive way of doing it. 
Indeed, if such a statement as “1 prefers coffee to tea more than 2 prefers 
tea to coffee (in profile 1)’ has any meaning at all, then an alternative 
rationalization of profile 1 can be given to show that the host should have 
served tea, namely: 


o 
& 
2834 
i ea 
& &£ # € 2 § "s 
eos § to ek 
1: el | |_| 
+ utility — utility 
Y 
ae ely 
Lio een! 
Sisal 5 § 38 
= eee gt a ger 
y Ree itge ny > Saas 
= oe re a oR 
a | oko bel me Mei eee 
er piility — utility 


Havi ‘ , ; 
eh argued both sides of the question of the independence of irrele- 
itio . ternatives, we would claim that it is a sufficiently “reasonable” con- 
to warrant an investigation of its implications. 
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‘aad identical in both profiles, the society’s ordering 
alternatives x and y, Say, are 


x versus y should be identical for both profiles. 


i dition is extremely powerful. For example, suppose that we 
Be a ii ls of a two-person society symmetrically in the sense 
fe 5 Pct ws iy z and the other z to y, society shall be indifferent 
, ; ety ict us try to arrive at the social ordering for the profile: 
etween y : 


1 2 
x Zz 
y x 
Zz y 


. hc reler 
Since both individuals prefer x to y, it is plausible that society fee ; 
x toy. On the other hand, for x and z, y is irrelevant by i Beales 
the symmetry assumption requires that society be peaiiteren cee 
and z; similarly, x is irrelevant when comparing z and y, so ‘abe ansitit 
to social indifference between z and y. But indifference is a 


- conc 
° ci SOG rary to our 
relation, so society is indifferent between x and y, contrary 
sion above that society prefers x to y. 


, ands 
“ue eae E ves x ant 
Condition 4 (citizen’s sovereignty). For each pair of oe € 
~ . ws 2 * ‘ 0 ds 
there is some profile of individual orderings such that society prefers x 1 J 


«fad: the! 
Consider any welfare function for which condition 4 is not at to) 
there exists a pair of alternati 7 
regardless of the preference ord 
the citizens of the society do n 
the pair x versus y, 
preferred to y” 


; refe 
ves x, y such that x is not P! 
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; co Oe other 
erings of the individuals. In pect 
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is imposed. 
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5x to y (for any x and ) society does a 
the preferences of other individuals, : 
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; ne welfare function does Not satisfy condition 5, then this aa vali t0 
matvidual is a dictator in the sense that if he prefers one 4 te 


14.4] Arrow’s Impossibility Theorem 339 


another sO does the society; the rest of the com! tunity ha 

society's choice between two alternatives only if he is indifferent between 

them. 
Arrow’s impossibility theorem states that conditions 1, 2, 3, 4, and 2 


are inconsistent. That is, there does not exist any wel fare 
possesses the properties demanded by these conditions 
nately, if a welfare function satisfies conditions 1, 2, and 3, 
imposed or dictatorial. 


> The central steps of the proof can be outlined without too much rigorous argu- 
ment. With respect to a given social welfare function a subset V of individuals is 
said to be decisive for the ordered pair (x, y) if whenever the members of V each 
prefer x to y society does likewise—regardless of what members not in V have to 
say about x versus y. In other words, if V is decisive for (x, y), then the coalition 
V can always enforce x over y by having each member express preference for x 
over y. 

By condition 2, a set V is decisive for (x, y) if and only if society prefers x to y 
when all of the members of V prefer x to y and all other individuals prefer y to x. 

From conditions 1, 2, 3, and 4 it is easy to prove 

Pareto optimality: The set of all sndividuals is decisive for every ordered pair (x, y), 
ie., if each individual prefers x to y, so does society. 

Arrow felt that conditions 2 and 4 were more basic than Pareto optimality, so 
he chose not to introduce it as a basic axiom. Later, Inada [1955] attempted to 
show that Arrow’s conditions 1 and 3, Pareto optimality, and a slightly modified 
but equally innocuous version of 5 also lead to an inconsistency, but, as Blau 
[1957] points out, his formulation possesses the same slight flaw as Arrow’s. His 
result is, however, true with our version of condition 1 substituted for Arrow’s, and 
it is also true with our condition 1 somewhat relaxed, but not to the extent done in 
these papers. Another formulation, based directly on decisive sets, is given by 
Weldon [1952].2 

Arrow’s fifth condition can be rephrased as: No individual is decisive for every 
ordered pair of alternatives. 

A proof of Arrow’s impossibility theorem is as follows: 


1. Suppose V is a minimal decisive set, .e., it is decisive for some x against some 
y, whereas no proper subset is decisive for any ordered pair of alternatives. Such 
4 set must exist since, as we mentioned above, the set of all individuals is decisive 
and the individuals can be removed one at a time until the remaining set is no 
Onger decisive for any pair. ‘This proves the existence of a minimal decisive set, 
eo cannot be the empty set since otherwise its complement, the set of all 

‘viduals, would not be decisive for some pair. 
ae j be a specific individual in V, W the remaining individuals in V, and U 

t of all individuals not in V. Since the society has at least two individuals, 
ey not both be empty. Let z be any third alternative, and consider 
Owing profile of orderings: 


2 
Soa must be taken with Weldon’s paper, for it appears as if Arrow’s theorem is 
‘i ee without using the condition of the independence of irrelevant alternatives, 
in fact it is used in the proof. 
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i i isi inst z,xPz. Thus, by 
By Pareto optimality, wPx, and, since {7} is decisive for x ae "betel 
transitivity, wPz, so {j} is decisive for w against z. Next, c 
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Since {j} is decisive for w against z, wPz, We 


fore, by transitivity, wPx, so 
have therefore established t 
But this is impossible, so n 


and, by Pareto optimality, ee 
{7} is decisive for w against x, as was to be Bs dictator 
hat {7} is decisive for all pairs and so Te 

© function exists which meets the five condl 
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What are the ramificati 
choice problems still arise 


must 
be side-stepped, 


to 
es ibilities: 
€¢ it, there are two distinct possibili “tio 


— 


Discussion of 


14.5] 
nd will be better discussed under (ii). 
a 


sider several proposals which fail to meet ths 
alternatives condition, but which do add 
jem Arrow considered. 

To illustrate once again a possible obj« 
axiom, consider the case of two individuals in r 
positions” and let their profile for two sdokide setts 


Individual 1: Ai preferred to Ae 


Individual 2: A2 preferred to A 
and suppose the welfare rule says A, and Az are indifferent for society. 
Now let us add the following alternatives: $1000 given to each, $1 given 
to each, —$1 given to each, — $1000 given to each. Suppose with these 
new alternatives included, the preference profile takes the form: 

Individual 1: ($1000 each) preferred to ($1 each) preferred to A, pre- 

ferred to Ao preferred to (— $1 each) preferred to (— $1000 each). 

Individual 2: A» preferred to ($1000 each) preferred to ($1 each) pre- 

ferred to (—$1 each) preferred to (—$1000 each) preferred to Ai. 

In a situation of this kind one might be inclined to say that these new 
alternatives are not irrelevant, and 7f we could believe 1 and 2’s orderings, 
then a “welfare” rule should choose A» over Aj for this society. If we 
reject the axiom of the independence of irrelevant alternatives, we Can 
inject judiciously chosen hypothetical alternatives into the picture to 
serve as a base line for evaluating comparative strengths of preference. 
However, if the members of society realize that these extraneous alterna- 
tives are not really feasible, it will behoove them to lie about their true 
tastes and to play a game of strategy. 

As to reforming the problem itself, there are several ways to proceed. 


i. Restrict the domain of the welfare function. Arrow required a social 
solution for every profile of individual preferences over three or more 
alternatives, no matter how chaotic it might be; possibly we should 
accept a less universal rule. For example, we might assume that there is 
some underlying structure to preferences which prohibits extreme diver- 
gences of opinion. This hedge, in essence, weakens Arrow’s condition 1, 
but in so doing it really changes the problem. We will return to this 
topic later (section 14.7). 

ii. Weaken the demands on the range of welfare functions. Arrow demands 
that both the individuals and society completely order all the alternatives. 
Wouldn't it suffice to have society choose only one or a few alternatives 
as optimal?” (Recall that in the previous chapter we concentrated on 
oe optimal sets, not on finding a complete ordering of acts.) Arrow, 

€ understand, did at first phrase his impossibility theorem in terms of 
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easily found for any subset of the alterna | Y event, since 
Arrow’s axioms are still inconsistent when translated into choice set 
terminology, we cannot avoid the impasse merely by this expedient, 
iii. Obtain more data on individual values. Had we demanded from each 


individual only his optimal choice and not his whole ranking, there would 
have been no difficulty in amalgamating these into a social choice. At the 
other extreme, instead of asking each individual only to order the alter. 
Natives, we could ask him to order all of the orderings of the alternatives 
For example, with three alternatives there are 13 orderings, and each 
individual could be asked to order this set of 13 orderings. Such addi- 
tional information enables one to extract some data about the strengths of 
an individual’s preferences among the alternatives. (Incidentally, if 
individuals ordered orderings, then a socially optimal set amounts to 
social ordering of the alternatives.) Or, looked at another way, if society's 
terminal action is an ordering of alternatives, then it seems reasonable 
to ask each individual to rank society’s terminal actions (i.e., each individ- 


ual should give an ordering of orderings). But even if this complication 
1s introduced, Arrow’s axioms can be 
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of Chapter 43), and let s1, 52, °° * , 5n be © individu 
(these play the role of the “‘states of nature” of Chapter 13). Gonsider tnt 
payoff array 

Sj ae a S 


Ay 


A m b] 

where u(A;, 5;) is a number which, in comparison with other numbers in 
the same column, reflects something about s,’s preference for Aj. In 
attempting to determine how much significance to give to the numbers of 
the payoff array, it is helpful to decide when two arrays are strategically 
equivalent. The m numbers u(A1, 5;), °° * ; u(Am, 53) of column j reflect 
s;’s preference for alternatives in the sense that the higher the number, the 
more preferred the alternative; ties reflect his indifferences. Hence, each 
payoff array induces a profile of individual orderings. Now, if we wish 
to abstract away completely the notion of preference strengths, as Arrow 
does, then we should treat as equivalent two payoff arrays which induce 
the same profiles of individual order preferences—two such arrays we will 
call order equivalent. Any strictly monotonic transformation of the numbers 
in any column yields an order equivalent array. 

For payoff arrays, Arrow’s conditions can be paraphrased as follows: 


Condition 1. A social ordering shall be associated to every possible array; 
m23,n>2. 
Condition 2 (positive association of individual values). If a given array 
Ch modified by adding positive quantities to some entries in the ith row, and if society 
originally preferred A; to Ax, it shall do likewise after the modification. 

Condition 3 (independence of irrelevant alternatives). Adding new 
rows shall not change society’s ordering for old rows. 


There is a clear analogy between the axiomatic development of decision 
oy given in Chapter 13 and the present discussion of welfare axio- 
ae The (act, state of nature) pair of the decision problem under 

inty is replaced by the (alternative, subject) pair of the welfare 
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With these assumptions, we can summarize the profile of individual 
preferences for lotteries in terms of an m by n array, where the entr) of the 
‘th row and the jth column is u(A;, s;). But since each subject’s m utility 


numbers are defined only up to a positive linear transformation (i.e., the 
origin and unit of measurement are not determined), two arrays should be 
treated as equivalent, utilitywise, if each column of one can be obtained 
from the corresponding column of the other by such a transformation. 
We can now see one tremendous difference between the social choice 
problem and the decision-making problem under uncertainty. In the 
latter problem it is meaningful to compare the utility for the (act, state) 
pair (Ay, 55) with the utility for the (act, state) pair (Ae, 54). However, in 
the former problem, comparing w(A2, 55) with u(Ag, 54) is meaningless (as 
it now stands!) since this involves an interpersonal comparison between sub- 
ject 5 and subject 4. 

It can be shown (simply by modifying the Blackwell and Girshick 
theorem mentioned in footnote 3) that conditions 1 and 3, Pareto opti- 
mality, and invariance under individual utility transformations, again lead 
to serial dictatorship, so it is clear that, if we are to exploit utility preferences 
of individuals, some inner bond must be hypothesized between the utility 
scales of different subjects. But how can this be meaningfully accom- 
plished? We turn in the next section to some proposals. 


*14.6 SOCIAL CHOICE PROCEDURES BASED ON INDIVIDUAL 
STRENGTHS OF PREFERENCES 


Social choice procedures which attempt to reflect individual strengths of 
preference (not merely rankings) must eventually contend with the inter- 
Personal comparison problem. Either a commensurating unit, or a base 
of reference, or both have been suggested. Goodman and Markowitz 
[1952] suggest an operational procedure for making interpersonal com- 
aa of differences of individual strengths of preferences. Nash’s work 
Sisce argaining problem can be generalized directly to yield a social 
Saeed for groups of exbitvary size. In this generalization no 
i tas comparison of utility is made, but the bargain situation has 
Wlizes i ase of reference—the status quo point. Hildreth [1953] also 

near utility scales to capture strengths of preferences; but, unlike 
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Conditions (i) and (iv) together state that the social ordering 

mined by society's choices for paired compari: 

condition (v), the part about the labeling of altern: pe 


‘qnocuous, but the part about invariance under labeling of individual 
major assumption. This 1s an egalitarian, symmetry, or democrac) 
assumption which may or may not be applicable dependi 
context of the problem. 

From these five requirements it can be shown that society must order 
alternatives strictly according to their average payoffs—or, what opera- 


tionally amounts to the same thing, society’s “utility” for an alternative is 
the sum of the individual “‘utilities” for that alternative. Note that the 
Goodman-Markowitz requirements are mathematically identical to 
Milnor’s axioms characterizing the Laplace criterion (i.e., the criterion 
based on the principle of insufficient reason) in the decision-making prob- 
lem under uncertainty. 

Goodman and Markowitz do not spell out in sufficient detail just how an 
individual can determine experimentally the number of distinct indiffer- 
ence levels between two specific alternatives. One simple way is to 
employ, at the outset, a “sufficiently rich” set @ of alternatives which 
includes, among others, all the alternatives that the individuals and 
society will be asked to rank. Each individual can then assign integers 
to the alternatives of this canonical set as follows: the numerical assign- 
ment to A, is exactly one unit higher than that to A, if and only if both 4; 
is preferred to A;, and there is no alternative in @ which is more preferred 
than A; and less preferred than Aj. Any specific social choice problem 
will then involve a small subset of available alternatives from @, but each 
of these will have a specific index for each individual. 

It is difficult for us to see how the set @ can be initially defined. How- 
ever, even if we assume, for the time being, that operational numerical 
assignments to alternatives can be effected, there still remains the real 
question: Does the number of “discernible discretion levels” between a 
pair of alternatives really measure strength of preference for a single indi- 
vidual, and should this be a basis for interpersonal comparisons in the 
context of social choice? The next example is none too encouraging. 

Consider two individuals s; and s2 who have to select one of two candi- 
dates A; or Ay; candidate A; is preferred by s1, and Ag by sz. To resolve 
the strength of preference problem, these voters are also asked to rank 
ee ele candidates, As, Aa, °° * »Aioo. Voter so is very dis- 

g, and he ranks the candidates A» over Az over Ag * * * Over Agg 

Over Ajyoo over A;. On the other hand, voter 51 1s dedicated to a single 
ei he divides all candidates into two camps, namely, those who are 
or” it and those who are “against” it; he is indifferent among the “forse 
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> Recently, Luce [1956 a] has given a general formulation precleremanas i 
where strict preference is transitive and indifference is not, which is see i 
equivalent to the idea of minimum sensible utility differences. These ig 
which are called semiorders, induce a natural weak order on the set of alterna- 
tives; and, if this weak order can be represented by a numerical utility Funct 
the semiorder can be represented by that function plus two just-noticeable a 
ence functions—one giving the just-perceived increases, and the other the decr am 
Furthermore, the converse holds: if three functions over a set of alternating p 

certain weak and natural conditions, then a semiorder is induced on he 
This generalizes the non-probabilistic discrimination model used in amen 
chology to arbitrary sets, and it does not suppose at the outset that just-nouces 
differences are equal to each other. These questions arise: when, if Seal 
domain of preferences, is it reasonable to assume j. n. d.’s are equal; — 

posing they are equal for each individual, when it is reasonable to equal’ ™ 
J. n. d.’s between people? 
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priate. However, in some 
all players) are explicitly prohit 


contexts, even though coalitions might not be proh 
seek arbitration which explicitly avoids the strategic aspects of coalition 
formation. Since we have already considered at length topics in n-person 
theory which, in some sense, can be thought of as social choice procedures 
which reflect coalition strengths (e.g., Shapley’s value and Milnor’s 
reasonable sets), we now turn to the case where coalitions are either 
explicitly or implicitly ignored. 

For completeness, we will state briefly an n-person modification of 
Nash’s axioms for the two-person bargain. 


Axiom 0. Each individual's preferences for trades and lotteries over trades 


can be summarized by a linear utility indicator. 


If T is a trade in 3, let u,(T) denote player ?s utility for T and let 
u(T) be the n-tuple [u;(T), w2(T), * °° > un(T)] representing the utilities 
of T for the n players. For brevity, we write u* = (m*,u2*, > * > Ua’) 
for u(T*). The set of all n-tuples u obtained by letting T vary over its 
domain 5 will be denoted by R. Thus, a version of a bargain is the pair 
(R, u*). : 

hoa social choice function associates to each bargain (R, u*) a single point 
a = (uy’, - - - , up’), which belongs to R. The element w’ is said to be 

Solution of the version (R, u*), and the trade or lottery whose utility 
appraisal is u’ will be termed the solution of the real bargain (relative to the 
Specific choice function employed.) We shall assume for mathematical 
Convenience that there exists at least one point u = (ui, U2, °° > > Un) 
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(ii) There does not exist a wu” in R distinct from u for which uf! > u!,i=1, 
ies, ft. 
Axiom 3 (independence of irrelevant alternatives). Adding new (trad- 


ing) alternatives, with the status quo point kept fixed, shall not change an old alter- 
native from a non-solution to a solution. 
Axiom 4 (symmetry). Let a version of a bargain have the properties thal 


@)  u;* =uj*,iandj =1,2,°°° 4%, 
and 


(ii) Jfu 9 (ui, - * * , Un) belongs to R, any permutation of the components of u 
also yields an element of R, 
the Fite} . . 
n uj =u;',iandj =1,2,°°°: 14, 
where ' = (y,! . 
uw’ = (uy’, us’, - - - , un’) is the solution of (R, u*). 


Nash’s proof for n = 2 can be modified directly to yield: wis a solution 


of (R, u*) if and only if u’ belongs to R, uj! > u;* for alli ~ 
(uy’ a uy *) (ua at u*) incl (u es, *) 
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individual utility transformations, wherea 
demand— indeed, his suggested schemes establish both a comm 
origin and unit of utility measurement. We will describe Hildreth’s 
technique for accomplishing this below. Nash, like Arro\ 
and Markowitz, invokes an irrelevancy axiom; Hildreth does not demand 


(,ooaman 


this. 
To summarize, Hildreth requires: (1) individual utility scales, 
rder for society, (iii) the usual Pareto optimality, and (iv) strong 
symmetry. The strong symmetry condition consists of two parts. 

(a) Invariance with respect to labeling of individuals. This is like the Good- 
man-Markowitz symmetry requirement, and it implies Nash’s symmetry 
condition. 

(b) Similar treatment for similar individuals. 
explanation. 

Let X“) mean the social state which results if the 7th and the jth ele- 
ments of X are interchanged. In other words, i and j get in X) what 
j and i get, respectively, in X; the returns to the other individuals are kept 
fixed. Individuals i and j are said to be similar if and only if (1) 7 reacts 
to any paired comparison X versus Y the way j reacts to X“ versus ye, 
and (2) other individuals treat i and j indifferently in the sense that 
they are each indifferent between X and X) for all X. Let S be any class 
of alternatives such that (1) S contains all lotteries with elements of itself 
as prizes, and (2) if X is in S, so is XI), By similar treatment for similar 


individuals, i and j, is meant that, if X* is optimal for society for the class of 


alternatives S, i and j are both indifferent Lceween ix? and’ xr", 
Hildreth next adds an assumption which is innocuous enough as it 
stands, but, as he applies it, it is clearly the weakest link in his argument. 


He assumes: 


(ii) com- 


plete o 


This will require some 


“There exist two states, say X*, Y*, for which the following hold 
(2) x;* = Xie Vee = bP (all ‘is Ws 
(6) X*P, Y* (all i).” 


We can find no fault in saying two such states X* and Y* exist—our 
objection comes from the fact that they might be used as reference points 
for interpersonal comparisons. If that is the intent, then some logical ia | 
underpinning is sorely needed. How does one choose an X* and Y* from ee 
the set of potential candidates? The author does not say anything about 
this, but implications can be drawn from the use he makes of X* and Y*. 

Hildreth shows that the above conditions (plus two operational condi- 
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i, Choose utility scales for each individual such th. 
aia) = 1, uAY*) = 0, for 7 = 1, 2, 


where u,(X) represents 7’s utility for alternative X. 


ii. Let g be any continuous, monotonic increasing, and strictly concave 


function. Associate to each X a social utility index U,(X’) where 


U(X) = glui(X)) + gluel(X)] + ~~: + glun(X)]. 


A procedure which orders alternatives according to the magnitude of the 
social utility index U, satisfies all of Hildredth’s requirements; this estab- 
lishes consistency of the requirements. 


> Still another way to attempt to deal with group decision making is suggested by 
the following example. Three industrialists decide to form a corporation with 
themselves as board of directors. They anticipate that future disagreements will 
arise, and, to forestall wrangling, they invite a consultant to draw up a constitution 
giving the rules for resolving any potential division of opinion. The three men are 
sincere in their efforts to cooperate, and there is no question about their falsifying 
their individual values. Essentially, the constitution must provide a technique lor 


amalgamating their individual values in any situation to obtain a choice for we 
Corporation. The consultant ascertains 
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(ii) That the individuals are to b 
capital investments, etc., differ. 
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By simple algebra 


u(A) = ¢o + ¢4u,(A) 


where ¢o = wb; + webe + wabs, ci = 
The consultant now asks the industrialists as 


alternatives as best as they can. From this series of group responses, the quanti- 
ties ci, C2, €3 are estimated. These estimates are then used to get group indices 
on the basis of which group action is taken in other more complex situations. 
This method is similar to Savage’s procedure (discussed in the last chapter) for 
getting a priort probability weights for the states of nature. There a series of 


hypothetical problems were first posed, and then a priori weights were derived from 
an analysis of responses. The problem of estimating ci, co, cs; may present difh- 
culties since a group may not be consistent with the model, in which case either 
an error theory has to be introduced or the group itself can be asked which triplet 
of c’s comes closest to predicting the group responses for the alternatives they have 
considered. 

Note that no attempt has been made to recover either the individual a and 6 
parameters or the w parameters. Indeed, any attempt would be futile since they 
are clearly non-identifiable. This analysis does not enable one to make either 
interpersonal comparisons of utility or an analysis of the distribution of influence 
or importance. < 


> Let us draw another analogy, this time between social choice and individual 
choice. Consider, for example, an individual who must order a set of alterna- 
tives, or stimuli, each of which is complex in the sense that it evokes reactions with 
respect to several attributes or “psychological dimensions.”” ‘The following analy- 
sis might be plausible: each attribute orders the alternatives, and the individual 
amalgamates this profile of preferences over the relevant attributes when giving 
his individual ordering. Interpersonal utility comparisons for the social choice 
problem are replaced in this model of individual choice by interattribute utility 
Comparisons. We imagine that most people feel it is easier to make operational 
Sense of interattribute comparisons within an individual’s mind than to make sense 
of Interpersonal comparisons. Anyhow, we do. But why? Primarily because 
the Individual himself can compare his psychological reaction to one (alternative, 
attribute) pair versus another pair. In unpublished work, Abramson [1956] has 
Biven a mathematical formulation of this model. Analogously, in the social 
e olce problem, an arbiter might conceivably ask himself whether he would prefer 
ing In 55's shoes if A» were adopted or in s4’s shoes if Ag were adopted. Although 
such considerations are undoubtedly employed by professional conciliators, it 
eg 4 most unhappy way of resolving interpersonal comparisons. It imposes the 
astes of an outsider and, according to our way of thinking, oversteps the natural 
Prerogatives of an arbiter. <4 


7 MAJORITY RULE AND RESTRICTED PROFILES 


We have seen that, if very dissimilar individual orderings are permitted, 
mple Majority rule can lead to an intransitive set of social preferences. 
Is leaves open whether there are any reasonable restrictions on the pro- 


les of indiv; , : 
 aalaee orderings—reasonable in the sense that they do not rule 


Si 


Practical, non-trivial cases—such that majority rule never leads to 
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an inconsistent social 
Arrow’s impossibility 


ordering. If so, this is one 
theorem. ‘The answer 1s that 


too much versatility of a social choice function, including 
Deer 
‘ ction 1 


majority rule) do exist which satisfy conditions 6G 
One “reasonable”’ restriction on the set of possible | individual 
orderings, which has been studied independently by ( 50. 1959 
1954] and Black [1948 a, b, c], is our present topic. 

Let us, for simplicity, exclude all indifferences or ties in rankings 
throughout this section. 

In attitude measurement theory, Coombs noticed that often when a 
group of individuals rank a set of stimuli many of the potential rankings are 
conspicuous by their absence. For example, suppose that a large group of 
individuals ranking the stimuli (or alternatives) {Ai, A2, A3, Aa} yield 
only the following 7 of the 24 possible rankings (recall that indifferences 
are excluded): 


1 2 3 4 5 6 73 


A Ao Ag A3 As Aa ies 
Ag Ay A3 Ag FAs Ais Ay 
a As Ay Ay A, Ag Ao 
A, Ag Ag Ag he A ya 


An explanati i izati 

An e) on, or ibe 
ae aaa , 0 rationalization, for these 7 and not the other 17 being 
s : . . . i ‘: 

: e existence of a “unidimensional underlying continuum. 
uppose the alternatives A, on | 
j b 
continuum 


A», A3, Ag are situated on some attribute 


e.g., a li i i : ‘ 
pikGesl ( 8+ iberalism-conservatism scale when the alternatives 4" 
political candidates) as shown: 


A 1 A 2 A 3 é 
Assume each individual 
the alternatives accordi 
the above diagram, 5; w 
Az, Ag, A, Ag. It can 
and if the distance from 
seven rankings (and on] 
of the ideal points. T 


Si has ani ; f > Ale 
: Aas an ideal on this continuum and that he ranks 


sea ets “distances” from this ideal. Thus ™ 

ee the pattern of decreasing preference 
at, if A; to Ay are ordered as show 

aa A» is less than that from A3 to A4, each of the 

a agi be generated by varying the loca" 
rankings are said to be compatible wi 


if : 
» Ua profile of individual orderings is compa 


— 
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th an underlying joint quantitative scal 
timuli and individual ideals are not uniquely 1o 
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constraints on the ordering of stimuli and indivi 
relative sizes of distances between stimuli, are dictat 
data (i.e. the profile of individual rankings). 

For the time being, let us assume that we are omn iscient to the extent 
knowing the exact position of the alternatives a nd the individuals on a j¢ 
continuum. Given this underlying structure for a profile of individual 


rankings, how can one pass from it to a ranking for society? Coombs 
[1954] suggests that a reasonable ranking for society is that given by the median 
individual along the continuum (assume for convenience that there are an odd 
number of individuals so that the median is well defined). Goodman 
[1954] has pointed out that in this model the ranking given by the median 
individual yields the same ordering as that generated by simple majority rule. 
This is simple to see. If the median individual prefers A; to 4; (ise: 48 
his ideal is closer to A; than to A;), then a majority of individual ideals 
are closer to A; than to A;, so a majority of individuals prefer A; to A;. 
Asa trivial by-product of this result, we note that, if a profile is compatible 
with an underlying Coombsian joint quantitative scale, simple majority 
tule yields a transitive ordering for society. 

In practice, however, Coombs has noted that most profiles of individual 
rankings cannot be rationalized by a joint quantitative scale, so he has 
suggested a simple modification. A profile of individual rankings is said 
to be compatible with an underlying qualitative (in distinction to quantita- 
live) joint scale if there exists some overall complete ordering of both 
alternatives and individual ideals such that 


A If, in terms of the overall ordering, 5; is not between the two alterna- 
tives of a paired comparison, the alternative closer to s; in the ordering is 
preferred, 


ul. If s; is between the two alternatives, then 5; may make either choice. 


For example, suppose a profile of individual orderings contain the seven 
rank orderings given on p. 354 plus the eighth ranking: A» over A3 over A4 
ae A\. These eight rankings cannot be generated by any joint quantita- 

| a this we can see as follows. The original seven rankings require 
| 79 e€ distance between Aj and A be less than the distance between A; 
a 3 The ideal point of an individual registering the eight ranking 

ie € to the left of the midpoint between A» and Az (since Ap is pre- 

Ate one A3), and, therefore, Ay must be preferred to Ay, contrary to 

as at jon. However, these eight rankings are compatible with an 

Ying qualitative joint scale. For example, an individual whose ideal 


a 


ms _— 
Se 
————— 
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s between Az and Ag (when the _ a i ert 
register the ranking A», As, A;,As !n “ ; - ‘ OF such an 
individual is that he rank A» over Ai and A3 ov é " 
Before we introduce qualitative joint scales int ‘ice prob 
lem, let us relate them to Black's work. Black on file of ind 
vidual rankings to satisfy the single-peakedness condite ; is some basic 
ordering of the alternatives such that, in passing from alternative to 
the next in this basic ordering, each individual monotonically rises to the 
peak of his preferences and then monotonically drops off in the direction 


of dis-preference.® ‘To see that the seven rankings on p. 354 plus the 
ranking A» over Ag over A; over Ag satisfy a single-peakedness condition, 
consider the basic ordering from left to right: A1, A2, As, A4. It is routine 
to verify that each ranking is single peaked in its preferences. For 
example, an individual registering the ranking 43, A2, Ai, Aq goes up in 
his preferences as he goes from A; to Az to A3 and then drops off. Its 
not difficult to see that a profile of rankings is compatible with an underlying 
joint qualitative scale if and only if it satisfies a single-peakedness condition. 

Black [1948 a, b] has shown that, if the number of individuals is odd and 
if a profile meets the single-peakedness condition, then there is exactly one 
alternative which receives a majority over any other alternative.’ Arrow 
has extended this result to show that for profiles meeting this condition 
simple majority rule generates a transitive ordering of the alternatives: 
Therefore, if Arrow’s first condition is modified to the demand that 4 
aaa be an only for those profiles of ma 
Se ies ahi con a e-peakedness condition (or, aaa 
ee en an con 
tions 2, 3, 4 and 5 ty Gecision satisfies this and his remaining co 

>t, » Provided the number of individuals is odd. 


es gain further insight into the 


rated 

a Coombsian joint qualitative v1. ee suppose that for a profile generat 
SMa es A. Alas Itative scale a majority of individuals prefer A; to Aj * 
i. € will establish that this assumption leads to a contradiction 


thereby showi ; 

ng that intransitivit; ant 8 
: tiv aroument 

the observation that, if an lee Cannot occur. The crux of the argu” att 


oS ative of the joint scale lies between tW° There 
a aNd an both of them for any individual. Try it! 
try about: on the Joint scale either | 
(i) A; lies between 4 jand A, 
> 
or 
Gi) Ay li 
es between 4, 
7 A; and A,, 
n this section 
be tinct 
alternatives, | a ‘ 


* assumin: see a 
f this assumpt § No indifferences in individual values f° 


clusio ion is 
ms have to be made. weakened, some simple modifications ° 


fA 
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or 


(iii) A; lies between A; and Ax. 


If, as is assumed in the hypothesis we wish 1 

A, and a (not necessarily the same) majority | 

one person in common to the two majorities, and ; 

this is incompatible with case (i). Similarly, if a n 
another prefers A; to Ax, then some person must prefet 


incompatible with case (ii). Ifa majority prefers A, to A; and A; to 4;, then some 
person must prefer Ak to A; to a which is incompatible with case (iii). [his 
completes the demonstration. An actual proof of Arrow’s result is constructed 
along these lines. Gj 


14.8 STRATEGIC ASPECTS OF MAJORITY RULE 


As we have seen, no rule can satisfy all of Arrow’s demands, but some 
seem to fail in better ways than others. Many feel that simple majority 
tule is one of the best, its main failing being that it leads to intransitivities 
for some profiles. For this reason and because of its considerable social 
importance, it has been discussed at length in the literature. 

One major feature of the simple majority rule is that it satisfies the 
axiom of the independence of irrelevant alternatives. Therefore, society’s 
choice between a pair of alternatives depends solely upon the profile of 
preferences for that pair, and so any further characterization of the rule 
can be based simply on paired comparisons. Under that restriction, 


May [1952] points out that simple majority rule has the following four 
properties: 


i Decisiveness. For any profile of individual choices, it specifies a 
emque group decision for each paired comparison. 

is Anonymity. It does not depend upon the labeling of individuals. 

ul. Neutrality. It does not depend upon the labeling of the two 
alternatives, 
_ W. Positive responsiveness. If for a given profile the rule specifies that y 
snot preferred to x (i.e., xPy or xJy) and if a single individual then changes 

1S paired comparison in favor of x (i.e., changes yP;x to either yJ,;x or 
xP.y, or changes y/;x to xPy), while the remainder of the society maintain 


thei : 4 : 
€ir former choices, then the rule requires that society prefer x to y for 
the new profile, 


oY also proves the deeper result: simple majority rule is the only rule 
'sfying these four properties. The idea of the proof is straightforward. 


Suppose tha 
anonymity, 
individ 


t a specific profile of individual choices for x versus y is given. By 
the group choice will only depend upon three numbers: the number N, 
uals who favor x, the number Nz who are indifferent, and the number 
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i indi rs ae 

ho favor y By neutrality the group must be a wien 

N, who lications of positive responsiveness (starting {1 om eae 

=, favors x if Nz > N,; therefore, by neutrality ty 

oup favo or ralit it ce 

ad “A Beale majority rule. The -.. prope yed all 

‘ i i roup decision. ‘ 

since we tacitly assumed a unique group 

hese four conditions are independent in 4, 


i hat t 
May also points out that ki 7 } 
sense that rules can be devised which satisfy any th f them while not 


satisfying the fourth. Examples are: (i) no decision in case dite fe 
N, = N,); (ii) simple majority rule where one specific ind ividual has two 
votes; (iii) two-thirds majority needed for x; (iv) unanimous decision 
required. See also May [1953]. 


As we know, simple majority rule can lead to intransitive group prefer. 
ences for dissimilar individual orderings. Modifications of simple major- 
ity rule have been suggested by Copeland [1951] which avoid the difficulty 
at the expense of violating Arrow’s third condition. Let u(x) denote the 
number of alternatives which lose to x, less the number which beat x in 
simple majority rule. In one modification, alternatives are ranked 
according to thisindex. For example, the Copeland indices for the profile 


1 2 3 
x y zZ 
yy ie x 
vA x y 


are u(x) ~ 49) = u(z) = 0. Ifa new alternative w is introduced and the 
profile is changed to 


ne <= 
Ss eS 
* xv’ & WwW 


J 

then = . 

ee ee Me mae We) = — 1, u(w) = 1. Thus, Copeland's 
2 tly the independence of irrelevant alternatives, since 


the paired ¢ . 
om 
Bot, . Te ioe Ogg of x versus y depends upon whether w is present or 
BG othe: Y meets the other axioms 
mo ification x . ; 
i pute. of er : era 
intransitivities is this- Eo simple majority rule which cannot °°" 


distinct paired ieee. r each alternative x, let v(x) denote the nu” € ‘ 

which are resolved 1 eae Over all the individuals which involve * ns 

Natives are ranked x's favor, less the number resolved against *- 

5) alternatives and ee to this index. (For example, if there 
0 Individuals, there are 40 paired comparisons jnv' 


are 
ol 
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‘ > S 125 be resolve 
ing each alternative x. Shoulc 25 be resi 


and 8 be ties, then v(x) = 25 —7 = 18 

Political scientists may not feel too disturbec 
tivities of simple majority rule, since in most | 
gre asked to select a single alternative, not to 
Often a single alternative to an existing law is suggested . 
are several alternatives they are forwarded in succession, each oe 


against the current status quo situation. Some may, t 


neretore, feel tnat 
intransitivities cannot arise under present practice. This is a naive view, 
however. Let x be the existing law, and let y and z be possible replace- 


ic 


ments. Suppose the legislative body divides into three equal groups, 
which, if called upon, would register the profile: 


Group 1 Group 2 Group 3 
x y ze 
v z x 
2 % y 


Suppose y is first pitted against x and then z against the winner. This 
ultimately leads to z, since, by majority rule, y loses tox and x toz. But, 
suppose z is first pitted against x and then y against the winner. This 
ultimately leads to y since x loses toz and z toy. Thus, as is well known to 
practical politicians, the final outcome can depend upon the order of 
presentation of the bills. If a defeated bill is reintroduced and then wins, 
the interpretation often made is that some people have changed their 
minds. This may be so, but it need not be. The interpretation usually 
made ignores the different status quo’s in the two cases. Thus, given the 
usual application of the simple majority rule in legislative bodies, observers 
may be quite unaware of intransitivities even when they do exist. 

Arrow points out that it can benefit an individual legislator to misrepre- 
sent his true feelings in legislatures which vote on successive motions by 
simple majority rule. He cites the following example. 


Let individual 1 have ordering x, y, 2; individual 2, y, x, z; and individual 3, 
2,),x. Suppose that the motions come up in the order y, z, x. If all individuals 
voted according to their orderings, y would be chosen over z and then over x. 
Owever, individual 1 could vote for z the first time, insuring its victory; then, in 

: eg between Zz and x, x would win if individuals 2 and ) voted according to 

. ce ghpuan so that individual 1 would have a definite incentive to misrepre- 
‘ca _ the problem treated here is similar to, though not identical with, the 

Jority game, and the complicated analysis needed to arrive at rational solutions 


a Suggests strongly the difficulties of this more general problem of voting. 
trow, 1951 a, pp. 80-81.] 


va another example in voting strategy is instructive. Let x be the 
: Sy eR é 
Sting law, and suppose that a three-quarters majority is required to 
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Consider a legislature composed 
1. 2, and 3 all prefer alternativ 
bf b 


in an attempt to keep y fro 


replace it. 
suppose groups 


However, 
passage. : : : ae 
suggests an alternative modification 2 P} 
i WSs: 
for x, y, 2 is as follo 7 
Group 1 Group 2 Group 

ve xm y 

#y) y x 

% y Zz 
Group 4 demands a vote between z and y to deter: vhich will be 
pitted against existing law x. If the legislative rules permit this order of 
voting, z defeats y, but it fails to get the three-quarters majority necessary 


to defeat x. Alternative x remains inviolate and group 4 gets its way, 
Note: Group 4 could be perfectly sincere in suggesting z instead of y! i 

Still another game-theoretic aspect of the simple majority rule has been 
cited by Majumdar [1956]. He considers a legislature where there are 
two people (or parties) which may sponsor bills, and a majority vote 
decides which of the two bills is passed. Suppose that there are four bills 
M, N, O, and P of interest, that the transitive preference ordering for 
sponsor 1 is M over N over O over P, and that for sponsor 2 the ordering is 
Just the Opposite. When any pair is presented, it is assumed that the 
alternative which will prevail by majority rule is known. Suppose the 
Outcomes are those shown in Table 14.3q. In Table 14.35 we have 


TABLE 14.3 
Sponsor 2 Sponsor 2 
gph iH 9 P M N Ose «8 
os ee ¢ Mf(4,1) (3,2) (4,1) (14 
‘ Bliviox» 5 Sponsor N | (3, 2) (3,2) (2, 3) Sar 
PIP NO. p mee 1) (2,3) (2,3) (2) 
eee Pi) (3,2) (2,3) (1.4 


replaced the out | 
(eae indicati Reet 
the two Sponsors, F y numbers indicating the ordinal preferences 0 


i ) 
gets a simple Aaa eee if 1 sponsors N and 2 sponsors 0, then ° 
a i M Pe i ae c 

Note that, as shown 2 the Corresponding ordinal preference 1s (+ 


P to M, and ie in Table 14.3, the legisla of, 
) to O; as we know, such a ble 


Paints ~ <0. 
Ple majorit 
Majumdar - y rule. 


ture prefers O to N, 4 
ansitives are perfectly po 


the 4PPropriate strategy for a sponsor in this 
bidder, and, if, unhappily, one is to mov’ a. 
t his true tastes, We may utilize our know 
© Comment on these observations. 
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First, since the sponsors have 
alternatives, and hence the outco 
what we have seen earlier, it is tl 
player to disclose his strategy. Itis1 
ew (to a smart adversary) that one 
Since this game fails to have a saddle poin 
of the maximin strategies must be randomize 
that 1 should randomize between M and N and 2 betwe 
players are to move simultaneously. °® 

Ifthe sponsor’s preferences are not strictly opposing, then the game is not 
constant-sum and so having the first move is not necessaril) disadvanta- 
geous. Of course, problems of preplay communication, arbitration, etc., 
arise. 

At first glance, simple majority rule appears to abstract away all indi- 
vidual intensities of preference. Dahl [1956, p. 90] writes: 

What if the minority prefers its alternative much more passionately than the 


majority prefers a contrary alternative? Does the majority principle still make 
sense? 


This is the problem of intensity. And, as one can readily see, intensity is almost 
a modern psychological version of natural rights. For, much as Madison believed 
that government should be constructed so as to prevent majorities from invading 
the natural rights of minorities, so a modern Madison might argue that govern- 
ment should be designed to inhibit a relatively apathetic majority from cramming 
its policy down the throats of a relatively intense minority. 

Yet we can argue that in some measure intensity of preference does 
receive expression in actual practice through *‘logrolling.”’ A senator who 
feels strongly about bill g and indifferent about bills r and s will trade 

is votes on r and s for the desired votes of senators indifferent about 4. 
Thus, his strong preference for bill q is recorded, and it may be passed 
even though according to the true tastes of the senators it should have been 
defeated. Is this good? That depends upon the bill g—at times we 
grumble ““Shameful!”? and at others chuckle “Beautiful strategy!” 

The dynamic strategic aspects of legislative voting generate lovely 
examples of n-person games in extensive form. If bill g would have been 
Passed without the sale of votes on r and s, then our senator has wasted 
assets that could have been put to better use. 

Filibustering and its threatened use is, of course, another method by 
Which an intense minority can exert pressure on the majority. A signifi- 
Cant difference between filibustering and logrolling, however, is that an 
intense minority can defeat an intense majority with filibustering but not 
With logrolling. 


8 
For player 1, row P is dominated by row N, so it can be deleted. Similarly, M and 
are dominated for player 2. Once M is deleted for 2, O is dominated for 1. 


} 
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There are alternative legislative voting mn ce ia 
easier for the legislators to express their intensity a Conside: 
the following rule: A group of bills concerned wit €8 are al 
debated and then simultaneously voted. Each in diode 


to be distributed over the bills any way he \ Oting for 


units § : 
ts of weights over a} is by secret 


each bill and the apportionmen 
ballot. The game-theoretic aspects are manifest. | ; appalling 
to contemplate the ensuing havoc and recrimination: 


Another important voting concept in democratic theory, which has 
received some mathematical treatment, is proportional re yresentation, 
March and Levitan [1955] specify some “reasonable”? conditions on a 


“political representation function,” which they show imply that the per- 
centage of votes won in an election by a party must be transformed into 
an equivalent percentage of seats won in a legislature. 

Let there be m parties, and let each of x individuals vote for a single 
party. Implicit in their work is the democratic assumption that all indi- 
vidual voters shall be treated equally, so any set of individual choices 


can be summarized by an m-tuple (x1, 2, ° °° > Xm), where x; represents 
the proportion of votes cast for the ith party. A political representative func- 
tion is a rule for assigning to each such (x1, %2, °° * ; Xm) an m-tuple (y1, 


Pea ym), where y; indicates the proportion of seats won in the 
legislature by the ith party.” We shall idealize the model by not requiring 
: 30 the total number of seats be an integer. Although this pro- 
e is hardly to be recommended in practice, academically a non- 
integral number of seats can be interpreted to mean that a radomization 
scheme will be used. March and Levitan require: 
1. Equal treatment of parties. (That is, the procedure does not depend 
ie es labeling of the parties.) 
ee ie oon sel depends only upon party voting strength. as 
requires that part nana independence of irrelevant alternative 
tribution of the a Presentation be independent of the irrelevant dis 
Oe ee a a vote over the other parties.) 
seats (analogous to hea team to add a condition of No wits" 
not mean fewer se - 
hele em to Arrow’s condition of positive associate 
condition 1 or conditi <A this need only be done if we weake? poe 
» for both are implied by our two requireme™™ 


. er 0 
of non-imposition), or More voles * 


lon 2 


7 . 
Mathematically, the word *“ i 
Proportion” here implies 


™m 


yi = 1 


t=1 


. 
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shen there are more than two parties. 
W fs , 
two parties, conditions 1 and 2 imply 

Re Xa) to the vote (%1, %2, 
sented strictly according to the percentage 
principle of proportional representation. 


The idea of the proof is as follows: From condition 
sentation y; of the ith party depends only upon the proj 
receives), it follows that y; is some function of xj, i.€., } 


the function f; does not depend upon 7, so we can write y = f(x), where y 1s te 
proportion of representation of any party that receives the proportion x of the 


yote. If u and v are any two numbers between 0 and 1 such thatu +vS 
follows that 


@) f@) +f +f —u-—2) =1, 
and 

Gi) fu+2) +f —u—r) =1. 
Subtracting (i) from (ii) yields 

(iii) fu + x) = fu) + fe). 


Let n be the total number of people voting; then both wu and v must be of the form 
i/n, where i is one of the integers 0, 1,2, ---,n. Ifwe choose u = v9 = 0, then 
(iii) reads f(0) = f(0) + f(0), and so f(0) = 0. Similarly, (ii) reads f(0) + 
f(1) =1, and so f(1) =1. If we now choose u = v= 1/n, then (iii) reads 
f(2/n) = 2f(1/n). If, next, we let u = 2/n, then (iii) reads f(3/n) = 3f(1/n). 
Continuing in this manner, we see that f(?/n) = f(1 /n); for P= 0,1, 2 < + ye 
But, since 1 = f(1) = f(n/n) = nf(1/n), we conclude that f(i/n) = 1/n and 


{(i/n) = i/n, as was to be shown. 
The number of parties, m, had to be greater than 2 to justify step (i) above, 
which required three parties getting proportions u, v, and 1 — u — v respectively. < 


Condition 2, which demands that party representation be independent 
of the way the remaining vote is distributed among the other parties, is 
extremely strong, and it has been and will be heatedly debated for a long 
time to come. Since condition 1 is quite innocuous in most contexts, 
debate over condition 2 boils down to a debate over P.R. itself—which 
inescapably involves such dynamic game-theoretic aspects as coalition 
rmation and the stability of coalitions in the legislature itself. We could 


i be tempted to go off in this direction if only we had something new to 
add, 


1 
49 GAMES OF FAIR DIVISION 


Al ‘ . my 26 
es | the methods so far described to amalgamate individual preferences 
* : : : eRe 
4 social preference have this one element in common: it is supposed 
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that explicit rankings of the several — ‘ m for Cach of 
the individuals. If these ground rules are C .. sae © Procedure, 
must be used. For example, a game Bey be devise lve the confi 
without necessitating that each individual presen ule of prefer, 
ences. The rules will be so concocted that eee p) act “rationally” 
in their own selfish interests the outcome is “socially desirable” or “fain 
to all the participants. On the macroscopic level it is olten alleged that 
economic markets fulfill such a role; it is clear, however, that this need not 
always be the case. The economic analogue of the prisoner’s dilemma (see 


section 5.4) appears not to result in a socially desirable outcome. In this 
section we shall be concerned with games explicitly designed to lead to fair 
outcomes. 

First, let us reconsider and modify the two-person bargain from this 
point of view (see Chapter 6). Recall that a bargain is characterized bya 
set 3 of feasible trades or reapportionments of the collective sum of goods 
and by a special distinguished trade 7* representing the status quo point 
(i.e., the division which gives each player the exact bundle of goods with 
which he started). We shall modify this by assuming that the players 
as a group are given a set of items to be divided among them, and not that 
they bring their own goods to be bartered. ‘The set of items might be any 
of the following: $100, a pie, a painting, a complex bundle of goods, several 
pieces of real estate, a set of obligations which they must jointly perform 
etc. Itcan be argued that in all these cases there is also a status quo point 
pee. Oe) i , ies before they were given anything to ame 
ee of “fair division's pean cn 
part he prefers; the as oe. geo parts and for the oy = * oe * the 
game by Senin af ad of the two roles can be eliminated * ae 
sense is this a “fair” os. . ee a : ho cm t it doe 
Set Salen: the lab ure: First of all, it is egalitarian in HE 

: abeling of the individuals; and, second, there 
presumption that the resulti i — exal” 
ng outcome will be Pareto optimal. il : ihe 


ple, if the commodi ae . 
divider is an even zi to be divided is $100, the obvious strates? ‘got 


divisible ak al ee the same applies to a pie or any oa ot 
! ie € 
y ctually, even in these oversimplified cas side! 
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cannot al 

Oe al people to follow this simple procedure: Rm ric 

man may prefer t a pauper who together come upon $10" ee as 

huis a eet a larger share to the pauper, an there! the 

pauper, in an a: € a 60-40 split. Even with their roles ee als? 

elect the 60-40 ae appeal to the conscience of the rich ma” a 
ie r than the 50-50 split. é 


i eee aa p 
problem, for J Rinaba is indivisible, such as a painting, presente is 0 
vider has no real choice. The procedur ere 
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nothing but flipping a fair coin to determine the lucky player. This 
resolution, although egalitarian and Pareto optimal for the problem as 
described, may fail on the score of Pareto optimality if the scope of action 
for the players is enlarged to include side compensation. We have in mind 
this sort of modification: Each player adds x dollars to the pot, where x 1s 
an amount clearly in excess of the worth of the indivisible commodity—a 

inting, 


say—and then the usual fair division game is played with the 

ox dollars plus the painting. The divider splits the pot into two parts, 
(1) the painting plus y dollars, and (2) 2x — y dollars. The chooser 
selects the part he prefers more. Hence, the non-trivial strategic element 
of this game is the divider’s selection of y. To be specific, suppose the coin 
selects 1 as divider and 2 as chooser. Furthermore, suppose 2 prefers the 
painting plus y dollars if and only if y > ye. From 1’s point of view, the 
unknown quantity y2 can be thought of as the true state of nature in his 
decision problem under uncertainty, and his optimal choice naturally 
depends upon specifying his a priori knowledge of the true state. Suppose 
1 feels indifferent between the painting plus y; dollars and 2x — y1 dollars. 
_ Then his maximin strategy is to divide the pot into these two indifferent 
parts. If, as dividers, both players are committed to their maximin 
es, clearly it is advantageous to be the chooser. On the other 
if each person has precise knowledge of the other’s indifference point, 
it is advantageous to be the divider. Since this is not entirely obvious, 
s look at it in more detail. Suppose yi < yo. As divider, 1 can get 
less than the painting plus y» dollars; as chooser, 1 can only expect 
more than the painting plus y; dollars. So the advantage to 1 in 
divider is roughly y2 — yi dollars, provided, of course, each 
are of the true tastes of the other. A more striking and well- 
of “divider advantage” arises when the divider is indifferent 
but knows that the chooser is sentimentally attached to 
of weakness—at least in a society of business men— 
1 the obvious way, so that when 2 chooses the painting 
the money for 1. On the other hand, if 2 is the 
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s [1948] reports a 8 “divide 

oe. et due to B. Knaster and S. Banach. Ty 

eich pertains to an infinitely divisible homogeneous co 
Ww 
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a cake, 1s as follows: 
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The partners being ranged 1, Ze os.: , e ne ake an arbitrary 
art. 2 has now the right, but 1s not oblige , to di ee hp € cut off, What. 
Sir he does, 3 has the right (without obligation) to dirninish still the already 
diminished (or not diminished) slice, and so on up to. Thr rule obliges rs 
“Jast diminisher” to take as his part the slice he was the last to touch. This nie 


ner thus disposed of, the remaining n — 1 persons start the same game with the 
remainder of the cake. After the number of participants has been reduced to two, 
they apply the classical rule fone divides while the other chooses] for halving the 


remainder. 


Knaster also suggests (cf. Steinhaus [1948]) a method of division appl. 
cable to the case where the bundle of goods to be distributed contains 
fairly indivisible objects. To be concrete, suppose a father leaves his 
single estate of four indivisible commodities to be shared “‘ equally” among 
his three children. Let the four commodities A, B, C, and D have the 
monetary values shown in Table 14.4. For the time being, assume that 
the monetary worth to each of the children of any subset of the items is 
merely the sum of his monetary evaluations of the individual items. 


Table 14.4 
Individuals 
Items 1 7 3 
A $10,000 $4,000 $7,000 
zB 2.000 1,000 4,000 
500 1,500 2,000 
800 2,000 1,000 
Total valuation 
1 13,300 8,500 1A 
ee 4,425 2,833.33 oe 
ommodities received v4 z. .. Band 
Monetary worth of commodities - 
- eed 10,000 2,000 6) 33 
Final division P2575 Eee wer 61 


50) D+ 2958.33 B,C + 
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to 2, so with it given to him he is = short (negative CECE) Nee | Be 
A similar analysis shows that 1 and 3 have a total excess of $5 975 7 
§1333.33 = $6908.33. When 2 is paid his deficit, there remains 2 tota! 
excess of $6908.33 — $833.33 = $6075. Each 1 layer’s share of this total 
excess iS 1g X $6075 = $2025. Hence the final division should give each 
player $2025 in excess of his “fair share.” It is easy to verify tha ; im 
general the final division will give each player at least as much as his fair 
share. 

This procedure can be generalized to unequal shares. For example, 
suppose the will had stipulated 16 share to 1, 3g to 2 and 1g to3. The 
fair shares are then 14 X $13,300, 3g. $8500, and 14 X $14,000, 
instead of $4425, $2833.33, and $4666.67, respectively. From here the 
analysis proceeds in a similar manner. It is also not difficult to suggest a 
modified procedure for situations when the monetary worth of a set of 
items is not the sum of the monetary worths of the items in the set. 

Once such conciliation machinery has been established, players may 
find it profitable to misrepresent their true tastes and to enter into coali- 
tions. For example, suppose 1 knew 2 and 3’s recorded evaluations. It 
would then be to his advantage to value A at $7001, B at $3999, C at 
$1999, and D at $1999. Player 1’s fair share is then 14(7001 + 3999 + 
1999 + 1999) = $4999, and possession of A yields only an excess of 
$2002. Thus, 1’s final division is A — $1135 instead of A — $3550. If1 
does not know 2 and 3’s valuations, it can be dangerous for him to mis- 
represent his tastes too grossly. On the other hand, a collusion of 
two players and collective misrepresentation of both their tastes is less 
dangerous. 

The “divide and choose” principle yields an alternate way for sharing 
the estate {A, B, C, D} which does not necessitate a prior recording of 
Individual evaluations. To surmount the non-divisibility feature, let each 
Player add $10,000 to the pot giving a total commodity bundle of { A, B, 
C, D, $30,000}, and then apply the n-person variant of the “divide and 
choose” principle to this set of goods. Again, the relative advantage or 
disadvantage in being the initial divider depends upon one’s a priori 
owledge of the true tastes of one’s adversaries. A random selection of 

€ order for the players eliminates this asymmetry. 
eo (welfare) function which dictates precisely how to pass 
i Q ual va ues to social preferences is too cumbersome and imprac- 
ae € employed in many contexts. Often, an automatic adjustment 

pee SS 1s needed which modifies the social choice slightly without necessi- 
rae oa Intricate re-evaluation each time there is a slight change in 

"Yidual values. It is extremely difficult for a thoroughly planned 
a which attempts to be egalitarian, to be flexible enough to cope 
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d do exercise indirect controls on 
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planners can an ~ COME of 5 


game by changing its rules and by altering e. MEETS Of the 
system. The feasibility of introducing games ir : es C0 resolye 
conflicts of interest in non-economic contexts is an } iguing area fy, 
research. Ideally, such a game should yield a unique equilibrium Point 
(assigning a “fair share” to each player) which is Pareto optimal It 


would be nice if, in addition, the players would each act in accord with 
their true tastes when at the equilibrium point. As long as we are dream. 
ing we might as well throw in a demand for a dynamic structure to the 
game such that even moderately intelligent mortals will be inexorably 
forced from non-equilibrium points toward equilibrium during repeated 
plays of the game. 


14.10 sUMMARY 


The social welfare problem, as Arrow formulates it, is: Given the 
preference rankings (ties allowed) of m alternatives by the members of a 
society of n individuals, define “fair”? methods for aggregating this set of 
individual rankings into a single ranking for the society. Such a rule for 
transforming an n-tuple of rankings—one ranking for each individual—into 
a ranking for the society is called a social welfare function. Arrow has 


shown that five seemingly innocuous requirements of “fairness” for social 
welfare functions are inconsist 


satisfies all of them), 
function has to resoly. 
(2) positive association 


ent (i.e., no welfare function exists which 
The five conditions are: (1) universal domain (the 
€ all conceivable profiles of preference patterns); 
of individual values; (3) independence of irreleva"" 
n’s sovereignty (or non-imposition); and (5) 2” 
Cussed the meaning and motivation of each of the* 
hed out the nature of Arrow’s impossibility P!°" 


a 


14.10] 

‘nto the data of the problem, then we hav 
ersonal comparison problem in the sens 
mensurable unit and/or base of referenc« 


discussed: 


;, Goodman and Markowitz employ a common unit 
thought of as a variation of either the just-noticeable-difference n« en 
used by sensory psychologists or the minimum sensibles of Edgeworth 
Their primary result is related to the criterion for decision making under 
uncertainty based on the principle of insufficient reason. 

ii. Nash’s work on the bargaining problem was generalized and inter- 
preted as a possible resolution to the social choice problem. Although 
this procedure does not introduce commensurable units, a base of reference 
(status quo point) is required. The Chapter 6 discussion of the Nash 
bargaining problem translates with only minor modifications into the 
social welfare context. 

ili. Hildreth also introduced strengths of preferences via utility assign- 
ments, and he established both a common unit and a base of reference by 
positing two special social states such that for each state the individuals 
receive the same goods and services and for which their preferences can be 
said to be equal. 

iv. In terms of an example, we outlined a method which takes into 
account both strengths of preference and the asymmetries of the roles of 
the members of the group whereby a group might combine the differing 
individual values to arrive at a group choice. In essence, the group’s 
manifest behavior in resolving some specific problems is used to estimate 

| certain parameters in a hypothesized model, and in turn these estimates 
) are used, via the model, to reconcile other cases of group conflict. 


One might have expected, a priori, that simple majority rule would 
Satisfy Arrow’s conditions. Indeed it does except when the individual 
tankings are very dissimilar, in which case it gives rise to intransitivities. 
It is natural, therefore, to search for reasonable restrictions to be placed on 
the profiles of individual rankings such that majority rule always leads to 
4 Consistent social ordering. ‘The concept of a profile which is compatible 
with an underlying Coombsian joint quantitative scale was introduced, and 
ue showed that for such profiles the median individual’s ranking on the 
Scale is the same as that induced by simple majority rule. Black’s single- 
Peakedness Condition is equivalent to the existence of an underlying 
| Slane joint qualitative scale, and Arrow has shown that, if these 
simple tee assumptions are met and if the number of individuals is odd, 

ajority rule can never yield a non-transitive social ordering. 
€ following three topics which relate to various aspects of simple 
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majority rule were discussed: (1) May's a; ‘ ‘ eCeSsary ang 
sufficient conditions for simple majority rute. al ternative 
employing simple majority eule for a . : , of profiles, ty 
domain may be left unrestricted and the rule so m that it alway 
leads to a transitive social ordering. the variations mentioned violate 
Arrow’s axiom of the independence of irrelevant alte: natives. (3) Game. 
theoretic strategical aspects arise when simple majority rule is employe 
for an unrestricted domain of profiles. ‘This was illustrated by examples 
of the difference made when bills are presented to a legislature in differen: 


orders. 
In the final section we reversed our tack. Instead of suggesting differ. 


ent planned programs for passing from individual values to social prefer. 
ences or investigating the game aspects of such plans, artificial games were 
concocted so as to have the property that when the players act in their 
own selfish interests the outcome is “‘fair’’ in the eyes of the planner. The 
use of games of fair division to resolve social conflicts has the distinct advan- 
tage that prior, detailed individual preference information is not needed. 
This plus added flexibility allows for more decentralized planning. 


appendix | 


A PROBABILISTIC THEORY 
OF UTILITY 


A1.1 INTRODUCTION 


Utility theory as formulated by von Neumann and Morgenstern (see 
Chapter 2) assumes, among other things: 

1. That, given two alternatives, a person either prefers one to the other 
or is indifferent between them. 

2. That there are certain well-defined chance events having probabili- 
ties attached to them which are manipulated according to the rules of the 
Probability calculus. 

In criticizing that theory, we emphasized that some experimental data 
Suggest that the latter assumption is in error; and, although we did not 
Particularly question the first assumption, we did stress the difficulty of 
obtaining transitive preference reports. ‘These two may not be unrelated, 
for, if we replace assumption 1 by the assumption that a person has a 
©ertain probability of expressing a preference for one alternative over 
another, then a single choice from each pair of alternatives cannot gen- 
rally result in transitive patterns. ‘Thus, it may really be assumption 1, 
Not 'ransitivity as such, which is the source of some difficulties in utility 
theory, ; 

We know of no direct empirical method to decide whether assumption 1 
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a? COA Probabilistic Theor ™ 
or the assumption of probabilistic preferences 1s aod 
a person expresses his preference between two alt wh . 
cannot distinguish between them. But, if we ask | 5 his Mis 
ence several times for a given pair of alternatives 1} eae 
and these, so long as they cannot be ruled out, see IMposgi} 
to decide between the assumptions. First, the ver 1g i ae Zz 
may change the situation so that the person’s secon taeda ire 
under the same conditions as his first one. If so, the preference 
could be different without invalidating assumption -qually, ier 
if the choices are prefectly consistent, we cannot conclude that assump. 
tion 1 is necessarily supported. If the choice expressed is remembered and 
if consistency is an overriding virtue for the person, then the chance of his 


making the same choice will be sharply increased from what it was origi. 
nally. That is, one effect of memory may be to alter the probabilistic 
structure. 

; So far as we can see, one is forced to select between these two assump- 
tions in terms of the overall adequacy and predictive power of the theo- 
retical structures which are possible in each case, not in terms of direct 
eee evidence. Our goal here is not to reach and defend a 

oice i 
a ee them but to show one possible structure generated by the 
Soaibilities Moat preferences are probabilistic. Nonetheless, the a prior 
. €s just mentioned raise basic empirical difficulties for both models. 
the very act of making a choice ] ituation, it is diff 
can alter the total situation, it is difficult 
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how convenient it may be—money sh« 
to the exclusion ol other commodities. 
Let us turn next to the second assu 
yon Neumann-Morgenstern theory 
there is a fair amount of evidence to 
jorally innocent of the calculus of pro! 
vations suggests that they are not consistent 


two chance events is more probable. f 
depend upon what are commonly called chance events, but for which one 


is very hard pressed to assign objective probabilities. Each spring a 
farmer must estimate the chance of another frost; from time to time most 
of us must decide about the risk of another plane trip; an investor must 
consider the likelihood of the market falling or not; and so on. It is 
difficult to see how to attach objective probabilities to these events in the 
certain way one does to a carefully manufactured and tested die. There 
are complicated cyclical fluctuations in the weather which are not ade- 
quately summarized in the statistics available to a farmer; it is questionable 
whether one plane trip compares to another in the way one flip of a coin 
does to another; etc. Yet subjectively we each assign some sort of fuzzy 
“probabilities” to such events, at least to the extent of feeling that one is 
more or less probable than another. ‘The fuzziness is suggested by our 
inconsistencies when we are forced to make the comparison several times, 
especially when we do not realize we are making the same comparison or 
when we have a lapse of memory as to our previous choice. So the second 
change we propose in utility theory is to admit that we shall be dealing 
with fuzzy subjective probabilities, not sharp objective ones. 

In sum, utility theory will be modified by assuming that people can 
neither discriminate perfectly between alternatives with respect to prefer- 
ences nor between events with respect to likelihood. This is nota question 
of psychophysical, i.e., stimulus, discrimination: we shall suppose that there 
is not the slightest difficulty in telling one alternative from another, or one 
Saeed from another, as physical stimuli. The assumed trouble is in sepa- 
oe alternatives consistently as to preference and events as to likelihood. 
ce 2 ta hat we wil oot give ¢ et of exioas 
that ensures the existence re utility Seca, eae a. " re 
that both a utility function and bjecti b bit ae 
and satisfy e “4 ce ; au jective probabil ity function exist 
eis es a ae ye le the expected utility hypothesis, 
engi oa a ‘ = iscriminations people make Satisfy 

ie otto mt ae an eur enquire into the implications of 
ters 13 and 14 wa oe our results is more like some of those in Chap- 

apter 2 in that it is an impossibility theorem 
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As in Chapter 2, suppose that a set A of p Pus 


sideration by the individual and that from these a set of gambles is devel. 
oped using chance events taken from a set (actually, a Boolean algebra) R 
Be ans If a and } are any two alternatives, or gambles, and a is an 
ee in E then the symbol aab will be used to denote the gamble in which 
zi . . / , 
ais the outcome if the event a occurs and 4 if it doesnot. (In Chapter 2,2 
somewhat different notation was used. Assuming event a@ has objective 
probability p, we denoted the gamble-by [pa, (1 — £)d], so the analogous 
i ab], where & denotes “‘not a.” It seems, 
notation for events would be [aa, ab], wher ! 
however, more convenient here to use the slightly more compact oe 
aab.) The set of all such gambles, including the pure alternatives ot 4, 
that can be so generated will be denoted by G. 


Axiom 1. For every a in G, aaa = a. 


rs 

In words, the gamble in which a is the outcome whether or not a ant 

is not distinguished as different from a itself. It is hard to quarrel wi 
: . . . ar ts : Vi 
this, although when combined with axiom 9 it implies that the subject! 


probabilities of an event and of its complement sum to 1, which Edwards 
[1954 c] has questioned. 
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objective probability 
As we indicated earlie 
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introduced in part to avoid th 
Neumann 


gambles from G, we suppose that there exis‘ “4 
P(a, b) that the given individual will prefer 4 on 
r, it is not easy to see how to estimate such prob +h, 
€ need not concern ourselves about that when des" 


as beth 


Lea 
perfect preference discrimination ae 


€ strong transitivity requirements of t 


apur 

and Morgenstern theory, it would be folly to ignore ane a 
i nce suggesting that Preferences are approximately aii 
1S €asy to go astray at this po 


: + tog gin08 
int by assuming certain inequalities “ 
the th is y ng I ay 
ae ee P(a, 6), P(d, c), and P(a, c); apparently this ee 
gs ie - Our tack is a bit different. Observe that, in a?! 
? Preferred or indifferent to” 6 if Bee ae cin G both 
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when Ay 
to see that 7 must always be transitive, but that in gene ral | 


ever these two sets of inequalities hold we shall wi 
easy | 
alternatives which are not comparable according to » 
tion we shall make about preference discrimination 
parisons are always possible, i.e., 


Axiom 2. For every a and b in G, either a > b or b > 


This a strong assumption, but we do not believe it to be nearly so 


vy so strong 
as the corresponding ones in Chapter 2. There, comparability was 
operationally forced by the demand that the individual make a choice 
but transitivity was in doubt. Here, transitivity is certain a nd compara- 
bility is in doubt. Although it is plausible that axiom 2 is met in some 
empirical contexts, the following example strongly suggests that this is not 
always the case. Suppose that a and b are two alternatives of roughly 
comparable value to some person, €.g., trips from New York City to Paris 
and to Rome. Let c be alternative a plus $20 and d be alternative } plus 
$20. Clearly, in general 


P(a, c) = 0 and P(b, d) = 0. 
It also seems perfectly plausible that for some people 
Ptb,.c),<>0 and Plas dase 0; 


in which event a and } are not comparable, and so axiom 2 is violated. In 
one respect this example is special: ¢ differs from a and d from 6 by the 
addition of an extra commodity which is always desirable; therefore, we 
may expect perfect discrimination within each of these two pairs. As we 
shall see, there are theoretical reasons for believing that the occurrence 
of perfect preference discrimination may require a somewhat different 
model from when it never occurs. 

Let us say that a and ¢ are indifferent in the induced sense, and write 
@.~ b, whenever both a 7 b and b > a. Wenext argue that certain two- 
Stage gambles should be indifferent. 

Consider the gamble (aab)Bc, where a, b, and ¢ are pure alternatives. 
If one analyzes what this means, one sees that outcome a results if both 
eand 8 occur, i.e., if the event a(\6 occurs; b results if both &@ and @ occur, 
L€., if 8 occurs; and c results if 6 occurs. A similar analysis of the 
Se a(a(\8) (bBc) shows that 4, b, and ¢ occur under exactly the same 
e Bee Thus, there is no difference between the two gambles, and 

s reasonable to argue that a person should be indifferent between 


ae We shall demand that this hold not strictly but only in the weaker 
“nse of induced preference. 
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A1.3 LIKELIHOOD DISCRIMINATION AND 
QUALITATIVE PROBABILITY 


: ide between the two gambles aa} 
Suppose that tod ect asking himself which alternative, 
and a8b. He can simplily his c siders more likely to 
bh. he prefers, and which event, a or #, he considers 
a Of the Rue combinations, two should lead to preference for 
a ie d to b, and a is deemed more likely to occur ae if 
2. b is preferred to a, and 8 is deemed more likely to occur ” 
ge een Hie probability that he will prefer « to 4 is P(a abel 
suppose that his discrimination as to the likelihood of eyenia ts ; moe 
independent of his preference discriminations, and that It Is ae en 
probability Q(a, 8), then the probability that he will both pl “ ea 
and deem @ more likely to occur than 8 is P(a, b) Q(a, B). aw i a a‘ 
probability that he will both prefer 6 to a and deem 8 more likely an 
than a is P(b, 2)Q(8, a). Since these two cases are exclusive of et ae 
the sum of the two numbers should give the probability that he wil! P 
aab to aBb. 9 dis 
The important assumption made in this argument is that the tig. 
crimination processes are Statistically independent. This seems ae 
able when and only when the subject believes the two gambles 4 se ~ 
be “independent” of the events a and 8, for, if alternative a depe” a i 
and he believes a is likely to occur, then he is really forced to comp oe f- 
outcome of a which arises when a occurs with a8, in which case his p? 


: were 
ence between aab and afb may be different from what it would be if4 ie 
independent of Qa. 


, t 
‘ There is at least one case when it is plausible oe end 
subject should deem a and } to be independent of a and 8, namely, ¥" 
ernatives having nothing to do with chance even'* 
Conclusion holds in that case. ‘al if 
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tions actually are statistically indepe: 


separ 
and it seems reasonable that people attempt 


ate preferences among alternative 


independent dimensions. On the other 
cates that people do play long shots, and such | 
the axiom. At the least, the axiom seems sufficiently compellin 


dictum of sensible behavior to warrant its investigation, and it can be 


jooked on as a generalization of related, but non-proba! ilistic, assump- 
tions found in other work, e.g., in Ramsey 
(see the second postulate in section 13.5). 

Our next axiom is comparatively innocent. Let us state it first and 
then discuss its import. 


PGC 1 es ee ele T4 QA) 
1931] and in Savage [1954] 


Axiom 5. For every a and b in G, 
P(a, 6) 2 0 and P(a, 6) + P(d, a) = 1. 


For every a and B in E, 


Q(a,8)20 and Q(a, B) + Q(B, a) = 1. 
There exist at least two alternatives a* and b* in A such that P(a*, b*) > Y. 


First, we have supposed that the P’s and Q’s are actually probabilities in 
the sense that they lie between 0 and 1 inclusive and we have supposed 
that the subject is forced to make choices between alternatives and 
between events. That is, he cannot report that he is indifferent between 
aandb. Experimentally, this is known as the ‘‘forced-choice”’ technique, 
and it isin standard use. It may be worth mentioning that, if one allows 
indifference reports in the sense of only demanding P(a, b) + P(b, a) < 1, 
then the mathematics leads to two quite distinct cases—the one we shall 
describe here and another one somewhat like it but apparently less real- 
istic, The final condition simply demands that the situation be non- 
trivial in the sense that not all pure alternatives are equally confused with 
respect to preference. 


From axioms 4 and 5, it is trivial to show that 


ea) Pa) — 1 
oPla by — 1 
fi : 
ery a and b in A such that P(a, b) # 1% [by axiom 5, at least one such 
t (a*, b*) exists]. This expression is useful because it permits one both 


to . 
a Re imine whether a given set of preference data do satisfy the inde- 
€nce assumption and, if they do, to estimate Q(a, 8). 


Q(a, 8) = 


5 eee ee i 
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In complete analogy to “induced — m5 ne wg: 
on the set of events E. We write a 7 B if 
Ola, 8) 2 Q(8,8) and Qs, 2 

for every 6 in E. We shall refer to this as the * Probability» 
(induced by Q) on E. One might expect us now t a ereigtaaaan 
ity axiom like axiom 2 on qualitative probability is is unnecessary 
as it is a consequence of our other axioms. Rather, an entirely differen; 
assumption, peculiar to the notion of probability, is required. We shal 
suppose that the subject is certain that the universal event ¢ of the Boolean 


algebra E will occur. For the moment, we will demand that no even 
have a qualitative probability in excess of ¢ or less than its complement, 


Axiom 6, [f ¢ is the universal event in E, then e a a “dl @ for every ain E 


A1.4 THE UTILITY AND SUBJECTIVE PROBABILITY FUNCTIONS 


So far, our technique of study has been similar to that exhibited in 
Chapter 2, but now we depart from that tradition by assuming that utility 
and subjective probability! functions exist having, among others, proper 
ties like those established in Chapter 2. Of course, neither of these two 
functions, however we choose them, can be a complete representation of 
the assumed data in the same sense that the utility functions of Chapter 2 
were. We no longer have a simple transitive relation to be represented 
numerically but rather a set of probabilities. The role of what we shall 
continue to call the utility and subjective probability functions will bea 


Partial and—as we shall see—comparatively simple representation of the 


So Aeon It is analogous to using a statistic such as the mean 
or standard deviation to gi ; a bability 
istri ° Ive a artial of a pro a ) 
distribution. 8 partial description p 


W . i 
G ie . suppose that there exists at least one real-valued function “ . 
€ utility function and at least one real-valued function ¢ 


called the subject; aX | a 
are met. smojective probability function and that the following axiom 


Axiom 7, x p the 
err reserves the induced pr ‘ reserves 
qualitative prob ability on E. i preference relation on G, and $f 
e, 


u(a) > u b i yy 2 
a) > : R 
sit o(a) 2 $(8) if and only if a > B, for a and B in E. ’ 
€ meaning of subject; bes is ® 
exactly the ee (ic acate Probability here will be self-contained ane sa! 


eee iy as discussed j : 
Similarities, sed in Chapter 13. There are, however, certain im 


— °°» 


A.A] The Utility and Subjective Probability Fy 


As this sort of condition is already very fami 
on it. 


Axiom 8. o(e) = 1 and $(2) = 0 


This prescribes more clearly the role of the universal eve1 
event which is subjectively certain to occur, and its complement is SU 
jectively certain not to occur. 

Given a subjective probability function ¢, we may follow the usual 
terminology for objective probabilities and say that two events a and # are 
(subjectively) independent if and only if d(a(\8) = $(a)¢(8). It is clear 
that we cannot ascertain which events are independent until we know the 
subjective probability function ¢, and so it would appear as though we 
were rapidly getting ourselves into a circle. However, it turns out that all 
of our final conclusions can be stated without reference to independent 
events provided only that axiom 4 can be extended in a certain way and 
that there are enough independent events—so many that no exhaustive 
check would be possible anyhow. These conditions will be formulated as 
axioms 9 and 10. 

Earlier, when we introduced axiom 4, describing the statistical inde- 
pendence of the two discrimination processes, we held that it should be 
met whenever the two gambles a and 5 are “‘independent” of the events 
a and 8, without, however, specifying what we might mean by this except 


that it should hold for all pure alternatives. We now extend axiom 4 as 
follows: 


Axiom 9. If a and b are in A and and B are events which are subjectively 
independent of event y, then 


Pl(ayb)ab, (ayb)Bb) = P(ayb, b)Q(a, B) + P(b, ayb) Q(B, a). 


Axiom 10. The subjective probability function shall have the property that, 


for all numbers x, y, and z, where 0 S * 9 z < 1, there are events a, B, and y in 
E such that 


(i (a) = x, $(8) = y, and $(y) = z. 
(ii) & and B are both subjectively independent of ¥. 


pr axiom postulates a very dense set of independent events, so dense 
ery conceivable subjective probability is exhibited at least twice. 
7 ee way, we are making a continuum assumption about the indi- 
i Seed oo via the axioms. Although we have never made it 

ore, such an assumption was implicit in the work of Chapter 


A or there we tacitly supposed (as is reasonable) that we could deal with 
Y objective probability. 
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[Al.5 
These two subjective scales satisfy the ext hypothes ; 
n A and ain E, 


Axiom 11. 
the sense that, for 4 and 61 


u(aab) = o(a)u(a) + $(&)ule 


At this point there should be little reason to disc idea further 
except to note that we have not previously restrictet b to be pure 
alternatives. Although no restrictions are usuall ed when the 
expected-utility hypothesis is made, it is always tacit y assumed that it only 
holds for gambles whose component events are independent of the event o 


of the hypothesis. In utility theory, of course, independence is meant in 
the usual objective sense. For our purposes, it is sufficient to assume the 
hypothesis only for pure alternatives which are trivially independent of 
events. 


A1.5 CONCLUSIONS ABOUT THE SUBJECTIVE SCALES 


On the basis of these eleven axioms, the following conclusions can be 
established as to the form of the discrimination functions and the subjec- 
tive scales. First of all, Q must depend only upon the difference of the 
subjective probabilities of its two events. Put more formally, there exists 
real-valued function Q* of one real variable such that 


Qla, B) = Q*[d(a) — 4(8)]. 
This result is interesting because of its connection with a very old problem 
in psychology. A century ago Fechner introduced into psychology 
a concept of subjective sensation, which has since played a crucial and 
aie role in the development of psychophysics. Even today; 
is idea in somewhat generalized form continues to be debated and t be 


the i : 
definition "A experimental studies. The modern statement of his formal 

: on of a subjective scale of sensation is exactly the property ae 
above for Q. Th J 


€ source of co : ; not con 
cern us here. ntroversy in psychophysics need 


Act : 
tion eae cab give a much more explicit result than that ¢ is a sens 
Cases. ne ae describe the mathematical form of Q. There ar three 

rst, there is a Positive constant ¢ and Q is of the form 
1 
2 t Aloe) — 406)!" if a > B, 
oem if a ~ B, 

ea 46[¢(8) — ¢(a)]§, if B > @. 

nd is the discontinuous function 


1, ifa > B, 
Q(a, B) == ly, es mw B, 
0, if 8 > a, 


Q(a, B) = 


(See Fig. 12) The seco 


1.5] Conclusions abo 
which results from the first case by tak i ig the 
represents perfect likelihood discriminat 
obtained by taking the limit as € appro: 


almost total lack of discrimination. 


o(a) — o(B) 


Fic. 1. The function 
1g + lé[o(e) — o(6)), «> 8B 
Q{a, B) = {i a~B 
1g — Ww[o(8) — oa), Bre 

for e = 0.1, 0.2, and 0.5. 


It is easy to see that in the first case, but not in he, aches Fwos one, Cam 
express ¢ in terms of Q, namely, as 


(a) a [Q(a, é) Fr, Qe, a) hi 


or, more usefully, as 


14+ Wl2Q(a, a) — 11%, if Qa, a) > 2, 
$(a) = ) 4, if Q(a, &) = 14; 
4 — Wit — 2Q(a, a], if Qa, &) < 1. 


Similar results hold for u and P over the set A of pure alternatives. 
First, P can be shown to be a function only of u(a) — u(6), for a and 6 in A. 
Second, assuming a Q of the first type above and letting ¢ be the constant 
determined there, then 


ve ifa ~ 6, 


P( fe 4 U[P(a*, b*) — P(b*, a*)I[u(@ — u()),—ifa > 8, 
a,b) = 
V4 — W[P(a*, b*) — P(b*, a*)][u(b) — ula)’, if b > a, 
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and 
Pla’, b*) — P(b*, a*) | 
u(a) = Pe ..4) — P(a, a*) i ‘ ror 
NS ares ? 
1 — | par, b*) — P(b*, 2*) 
where a* and B® are mentioned in axiom 5. Any positive linear trans. 


formation of u is equally acceptable. . 

Thus, we have the following situation. If the axioms are accepted and 
if it is assumed that discrimination of events 1s neither perfect nor totally 
absent, then the mathematical form of the model is completely specified 
except for a single parameter €, which appears to reflect the individual's 
sensitivity of discrimination; and the two subjective scales can be inferred 
from the empirical estimates of the probabilities P. The subjective 
probability scale is unique, and the utility scale is unique except for its 
zero and unit. There is only one trouble with all of this: it is extremely 
doubtful that people satisfy all the axioms. 

An example and a theorem will formulate our doubts. Although the 
mathematical argument used to establish our results rests heavily on 
steps involving independent events, the final results can be shown to hold 
for events whether or not they are independent, so we need not worry 
about independence in a counterintuitive example. Consider the two 
chance events: rain on Wall Street at time ¢, and rain on both Wall 
Street and 34th Street at time ¢. Since the locations are not widely 
separated, both being in New York City, it is highly likely that if it rains 
Wall Street it will also rain on 34th Street, so the subjective probability 0 
i ee osale se will only be slightly larger than ad ate 
ee ES € is asked which is more likely, it seems silly ar ae 

so, we have ¢(a) and ¢(@) very close and Q(a, B) = 


: e 
Beppe actually behave in this way when making choices, then at ell 
ol our axioms must be false. 


A1.6 AN IMPOSSIBILITY THEOREM 
Casual i ‘ 
involving ota Suggests that there are many situations, 08 a 
e . e 1 
Fi S Of money, in which these conditions ca? ae 


rst, there are oy 
ey hcn te « at least thr , re pet 
discriminated with res ee prospects a, b, and ¢ which are P Pla A 


=1. This wi pect to preference, i.e., P(a, 6) = PU c) = ad 
ee $10, 5 = ¢s ae po ere, hen all other things are a . 
and 8, » and ¢ = $1. Second, there are at least w° aie 


which . 
are neither perfectly discriminated nor equally 


A a 121: a aD — 292 
1.6] An impossibility ineorem Jt 
- P, 
Al. 


ie. such that Q(a, B) # 0; 725 “a - ‘Sra imposslOulty theorem 
that these two assumptions are inconsistent with the cieven axiom: 
previously stated. re 

This result seems disturbing, for most of the assum] 

‘; based have, by now, acquired a considerable respectabilit yet, 
clearly, they cannot all be satisfied. The task of reappraising them is quit 
delicate, for there are numerous reasons for supposing that they are not 
terribly far from the truth. Some of these reasons have been given in 
Chapter 2. Another is that the derived form of the discrimination func- 
tion for events is sufficiently similar to much discrimination data to suggest 
that we are not completely afield. 

It would appear that six of our assumptions are subject to the greatest 
doubt. Of these, three (axiom 2, requiring that every pair of gambles be 
comparable by the induced preference relation; axiom 3, requiring that 
two gambles which decompose in the same way be indifferent in the 
induced sense; and axiom 4, requiring that the two discrimination proc- 
esses be statistically independent for pure alternatives) are subject to direct 
experimental study. The other three (axiom 9, requiring that axiom 4 
hold for certain gambles involving subjectively independent events; axiom 
10, requiring that certain triples of independent events be extremely 
dense; and axiom 11, requiring that the expected-utility hypothesis be true 
for pure alternatives) are impossible to study directly. Because of this, 
one can expect that most attempts to get out of the bind will be concen- 
trated on the second three. 

Since all the rest of decision theory is so dependent upon the expected- 
utility hypothesis, special attention will undoubtedly be given to axioms 9 
and10. There is the intriguing possibility that these subjective scales are 
discrete rather than continuous, as has generally been assumed, which 
would make them more in accord with the way people seem to classify, 
say, events: impossible, not very likely, etc. In that case, axiom 10 might 
be abandoned. On the other hand, axiom 9 when coupled with our 
definition of independence may be the source of difficulty. As the axiom 
seems reasonable for one’s intuitive idea of subjectively independent 
events, it may be the definition that should be altered. 

As it stands, two conceptual features of this theory are of interest. 
First, by making the assumption that the two discrimination processes are 
statistically independent, it has been possible to deal simultaneously with 

oth subjective value (utility) and subjective probability. Second, by 
ang axioms which are closely related to those of traditional utility theory 

the independence assumption (axiom 4), it has been possible to 
ao that both utility and subjective probability form sensation 
n the Fechnerian sense. In psychophysics it has been argued, 
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though never fully accepted, that subjective expe 
sented by such scales; however, the defining con 
nor has it been derived from other assumptions. 

has been to postulate this condition as an a priori 
sensation, and, of course, many have objected that ; 
cated to be accepted as a basic axiom. Whether 


this one and that arrives at sensation scales as a co! 


[Al.g 


USt be renee 
elther simp) 


e 
ional Practice 

: of subjective 
ne sophisti 
| that parallek 
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postulate, can be developed for psychophysical problems is not know 
: : MN, 
For a fuller statement of this theory and for proofs of the assertions 
d 5 y SCE 


Luce [1956 8]. 
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THE MINIMAX THEOREM 


A2.1 STATEMENT OF THE PROBLEM 


The general two-person zero-sum game with finite pure strategy sets 
can be characterized as follows: 


i. There are two players, 1 and 2. 


ii, 1 has a set A = {ay, a2, * + * , &m} of m pure strategies. 

iii. 2 has a set B = {81, B2, * * + , Bn} of n pure strategies. 

iv. Associated to each pair of strategies (a;, 8;) is a payoff of M (a, B;) 
Units from player 2 to1. M(aj, B;) is abbreviated by a;;. Hence the 
values to 1 and 2 of the strategy pair (a;, B;) are a;; and — 4;; units respec- 
t 


ively. Because these values sum to zero for every (a, B;) pair, the game 
18 called zero-sum, 


Vv. Player 1 may adopt a randomized (or mixed) strategy by employing a; 


With Probability x1, a2 with probability x2, +--+, a, with probability 
Xm, where 


3 


Such a strate 


ae 8y is symbolically represented by x = (x a, Xoo, °° + 
m&m), 


The strategy (0a, 0a, °-* , 1a, °° ° » 0m), which places all 
385 


max Theorem 


idered to be the same as the | oer. 
player 1 isd 5 
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i ., is coms 
the weight 00 % is 
set of all randomized strategies for 


indicates the number of pure strateg! Pp 
vi. The generic randomized strategy 10r < ' an 


. ++, 9nBn)s where 


yy =1 and Iz 2 


au 


es available 


y Bos 


The pure strategy B; 1s considered to be the same as the randon; 
strategy (081, 0B2, ha 1B;; ° Tai OB,)- The set of all enti 
strategies for 2 is designated by Yn. 11264 
vii. For each randomized strategy pair (x, y), the payoff M(x, y) to! 
defined to be rY) tolis 


M(x, y) = y ii j)j 
i=1j=1 
a > 1) 9) x:a1;) 
j=1 t=1 
; ss a Xi > aisyi)) 
the payoff to 2 is — M(x a az 
The symbol 7). 
oy) = > Oi5)3 
j=1 


° yoff to 1 Ww 
Quite analog hen 1 uses the pure strategy a; and 2 1° y 


ous] 
y, when 1 uses x and 2 uses B;, the payoff 1s: 


™m 


p%/8;) = 5X; 


Of 
Course, t=1 


viii. Symbolj ll es) = aij. 
= ca 
an BM a. denote the whole pure strategy 4° 
> the t uts i 1 incl i 
Wo pure strategy aoa ere the principa ingle 
s and the payoff function *"" she 


extension of 

. (4, B 

triplet teas y, oo to spaces of randomized denote 
» M). zed strategies is de? 


ix, 
layer 1's aim is t 
Os 
elect a randomized strategy x from Xm 


A2.1] Statement ot the Probiem 
maximize his return or, equivalently 

nature of the game), to minimize 2’s ret 
“a of the game depends upon the players 
given the number M(x, y) for each pair (x, y), 
M(x, y) by choosing x and, simultaneously, 2 atte ta a 
M(x, y) by choosing y. The rules of the game require that eaci 
choose his strategy (pure or randomized) in complete 1 
opponent’s selection. 

x. For each x belonging to X,,, player 1’s security level i 


v(x) 


min M(x, y). 
y 


Since 


M(x,.y) = 5 (y x0:;) = y yjM(x, B;) 
j=1 


j=l t=1 


is a weighted average of the n payoffs M(x, 8;),j = 1, 2,° °°, n, it is 
minimized when all of the weight is assigned to the least of these, i.e., 


v(x) = min [M (x, Bx), M(x, B2), mee th ig M(x, Bn)]. 


We may interpret v;(x) as the return to player 1 if he discloses to 2 that x 
is his choice and if 2 is allowed to choose his best response to x. 
If 1 wishes to maximize his security level, he must choose a strategy 
x such that 
v(x”) 2 v(x), for all x of Xm. 


Thus, if we let v;(x) = vj, then 
vy = v(x) = max v;(x) = max min M(x, y). 
x x y 
Note that v;(x‘) = v; implies M(x, y) 2 21, for all y; hence, x‘ 
guarantees to 1 a return of at least v1. A strategy x” which maximizes 
Vs security level is called a maximin strategy for player 1. Maximin 
Strategies always exist, but they need not be unique. We let 0, (0 standing for 
Optimal) designate the set of all maximin strategies.!_ Thus, if x* belongs 
to ©), then x* has a security level of v1. If x’ does not belong to 0, then 
x’ has a security level less than 2. 
| xi, Because the game is zero-sum, we may phrase 2’s aims as the 


Minimization of 1’s return rather than the maximization of his own. If 2 
uses y, 1 cannot obtain a return greater than 


ve(y) = max M(x, y). 


1 
The set ©; is a closed convex set. 
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i aximize his s¢ a ae 
In perfect analogy t© iadzying to ™ cL, 2 thing 
seit Let y‘° be such that 
minimize vey). Lely 
all ¥ 

v2 = vay‘) € v2(y); for all 
Then, ; % 

v2 = voy‘) = Sey ag M(x, Y)s 

nd 

: M(x, y\”) < ve, for all x. 
The strategy y‘” is called a minimax strategy for 2. We let O2 denote the 


set of all minimax strategies for 2. Thus, if y* belongs to 0», then 1 can 
surely be held down to at most 2 by using y*. If, however, a y’ is used 
which does not belong to O2, then it is possible for 1 to get more than », 

xii. Thus, if 1 uses a maximin strategy, he guarantees himself a return 
of at least v; units. If 2 uses a minimax strategy, he guarantees that 
4 cannot receive more than v2 units. Hence, it follows that v; < 2. 

xiii. A pair (x’, y’) is said to be in equilibrium if x’ is good against y' 
[ie., M(x, y’) < M(x’, y’), for all x] and if y’ is good against x’ (ie, 
M(x’, y’) < M(x’, y), forall y]. Theseconditions may be written simply 
as 

M(x, y’) < M(x’, y’) < M(x’, y) 


for all x and y, or equivalently as 


_ M(x, y’) = M(x’, y’) = min M(x’, y). 
y 
The following theorem is fundamental to a real understanding of 
main result in the two-person zero-sum theory. 


Theorem. Each of the following three conditions implies the other tw 


Condition 1. An equilibri . 
. equilibrium pair exi 
Condition 2. pert erst. 


7 = m * , 
a oe M(x, y) = min max M(x, y) = ”2 
(i.e., the order of the a x . 
operators max and mi , : recht 
1 min mak nce, oF m 
jargon they : x . és no differe » OT, 
» Mey are commutative). 
Condition 3, 


There exi (0) jn Js 
such that exists a real number v, an x in Xn, and 2Y 


(a) a0) S ' 
2, asl 7 DV, Pd 2.0 +, n, 
0) Yay 

f a7; Sv, fori = 1, 2, 


Ey it 


= 


Statement of the Problem 
A2.1] 


. (0) : “an oNnATA 
(That is, by adopting x° © player 1 can guar 


S 


. 0) , 9 ah op tha 
by adopting y player 2 can guarantee tha 
Proof. 7 implies 2: Let (x’, y’) be an equils 


vo = min max M(x, y) < max M(x, y’ 
i. ¥ x (2) «x 
= min M(x’, y) < max min M(x, y) = 71 
a) ¥ (5) x y 


These equalities and inequalities are justified as follows: 


(1) Definition of v2, number xi. 
(2) Definition of minimum. 
(3) and (4) Definition of equilibrium pair, number xiii. 
(5) Definition of maximum. 
(6) Definition of v1, number x. 
But, from number xii, v1 < v2, so v1 = 22. 
2 implies 3: Let v = v1 = 22; let x‘ be maximin, and let y’” be mini- 
max. We then have for all j and z 


i 


Yaga’ = M(x, 8;) > min M(x, y) = max min M(x, y) =2 
(1) Qinvy (3) «x y (4) 


= min max M(x, y) = max M(x, y°) 2 M(a, y) = y a;;y5”. 
i) y x (6) «x (7) (8) ‘F 


] 
These inequalities are justified as follows: 


(1) Definition of M(x, y), number vii. 

(2) Definition of minimum. 

(3) Choice of x, 

(4) and (5) Definition of v and condition 2. 
(6) Choice of y 

(7) Definition of maximum. 

(8) Definition of M(x, y), number vii. 


3 implies 1: From a and b of condition 3 it follows that 


M(x, y) 2 v 2 M(x, yy); 


M(x V0 But putting x = x and y = y\ we see v = 
xs ¥); hence (x, y‘) is an equilibrium pair by definition xiii. 
tks. (a) From the proof that 1 implies 2, we see that, if (x’, y’) is 


an €quilibrium 4 ? , : ee dean E 
pair, then x’ and y’ are maximin and minimax respectively. 
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(b) From the proof that 2 im 
» of condition 3. vi 

(c) We still do not know whether ! — er aLeRY game 
(A, B, M), an equilibrium pair exists, or if v1 = v2, , t there exists, 
triplet (v, x”, y”) satisfying ¢ and Fas oo ‘ The Princip 
theorem, generally known in the literature as the minim ix theorem, estah 
lishes this existence; it was first proved by von Neumann in his 1928 Paper, 


1 aNd dp it, 


hether for an arbitrary finit 


A2.2 HISTORICAL REMARKS 


The several proofs of the minimax theorem which exist fall into two gen. 
eral categories: those which rest on fixed-point theorems or iterative Proc. 
esses and those which depend upon separation properties of convex sets, 
In giving some geometrical insight into the principal theorem (Appendices} 
and 4), in describing the linear-programing problem and its relation to 
two-person zero-sum games (Appendix 5), and in surveying the methods 
for solving such games (Appendix 6) we almost, but not quite, prove the 
theorem in several different ways. As none of these incomplete proof 
are of the fixed-point variety, a complete and elegant proof due to Nash 
[1950 a], based on Brouwer’s fixed-point theorem, will be included in the 
next section of this appendix. But first some historial remarks, which ate 
little more than a partial summary of Kuhn’s [1952, pp. 71-84] excellent 
survey of this literature. 

The first proof of the minimax theorem was given by von Neumanq 
[1928]; it, too, made use of Brouwer’s theorem, but is quite involved. 
Motivated by von Neumann’s 1928 proof, Kakutani [1941] presented 2 
oon eralization of Brouwer’s theorem which is tailor-made to prove ie 
minimax theorem—so much so that it becomes almost a trivial ord 
‘ m. We have chosen to use Nash’s proof rath 
{ns because it depends only upon the intuitively oa 
ae wer theorem. In addition, Nash’s proof is related ' * 

i : technique discussed in Appendix 6, 

pe Slerae though still Partially topological, proof was given 
the statement of the minimax theorem 18 oe 
Id be possible to give an entirely algebraic ast 


ou 
he shortest self-contained proof, is due to Lee. 


ntary, 


other algebraic 
Tucker [1950q]. 


> ae 
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A2.3 NASH’S PROOF OF THE MINIMAX THEOREM 


In broad outline, Nash proves the theorem in this way: He define 

transformation TJ which maps mixed strategy pairs (x, y) into mixed 
: ek , , Arh iia The Hee araneutiee 

strategy pairs T(x, y) a (x 9 y ie W here i has the two ime UP i a 

jx and y are optimal strategies if and only if 7(x‘”, y‘”) = 
(x, y), ie., if and only if (x6), y‘”) is a fixed point under the 
transformation. 

ii. T has at least one fixed point. 


The transformation is defined in this fashion. Let 


p< M(a:, y) — M(x, y), if this quantity is positive, 
eee? 0, otherwise; 
ies 7) = M(x, y) — M(x, 8;), if this quantity is positive, 
0, otherwise. 


Il 


Using the notation T(x, y) = (x’, y’), we define 


"> xi + ¢;(x, y) 


1+) cx, y) 
k=1 


xy 


and 


, _ Ji + d(x, y) 


1+ ) d(x, y) 
k=1 


Vi 


It is straightforward to verify that 


xi 2 0, o xy = 1, yj 2 9, and y yi = 1. 
i=l 


j=1 


We first show that (x, y) is a pair of optimal strategies if and only ifitisa 
fixed Point of this T. Observe that c;(x, y) measures the amount that a, 
Ctter than x, if at all, asa response against y, and that d(x, y) measures 
r € amount that 8; is better than y as a response against x. Now suppose 
aor x and y are optimal. Since x is good against y, it follows that 
i\X, y) = 0 for all i, so x,’ = x,, for all 7. Similarly, y,’ = y,. Thus, if 

may) isa Pair of optimal strategies, T(x, y) = (x, y). 
© show the converse, suppose (x, y) is a fixed point. We first show 


t 
hat there must be at least one i such that both x; > 0 and ei(x, y) = 0. 


[A2, 
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Since, by definition, 


m 


M(x, y) = ), *M(as y), 


t=1 


t ho l] 1 such hat 
Dp ee eee cle) — Hou») ~My 
+ ys Thus, for at least one 2, oe A Be | 
| . i Lili 
But for this 7, the fact that (x, y) is a fixed p 


xy 


d 
(hd 


i-=- cK (x, y) 
2, 


so S cx(x, y) = 0. But the terms c;(x, y) are all non-negative, so they 

hash all equal 0. ‘Thus, x is at aa as .. — ae y “ 
; ‘ e 3 

Pri G. c. sane LA Sagal a: the proof of the first 

a of a fixed point for T follows from the Brouwer - 

point theorem. We shall state a particular version of that theorem 


i he version 
indicate how it can be used to prove that 7 hasa fixed point. T 
is this: 


nteri ted in 0 
If a function maps each point of a sphere S (interior plus boundary) e 5 
Euclidean space of finite dimension into another (not necessarily distinct) p 


. . 5 . is J fe 5 mappe 
and if the function is continuous, then there exists at least one point which 1 
into itself. 


: eh ; here 
In our Case, the set of mixed Strategy pairs is certainly not a s on 
but when we don the topologist’s glasses it can be made to look li 


: en out 
More specifically, we can find a one-to-one correspondence betwé 

set of strategy pairs and the 

ways in the sense 


into Points “close”? 


hich # 
together in the other. The mapping Se 
clearly continuous, 


¢ 
when iterated with this one-to-one etal con 
Phere into itself that is easily shown 10°" | 


q fix 
i m, the induced mapping has 
Point; hence, 80 does T. 


w icZ an 

th Ww: 
Prove the Brou €r theorem here (for a proof see Hurew se 
but it can be 


z e 2 re 

made extremely plausible in 2-spa° i 

: the sphere, i.e., circle in th I d F the mapp!®> er! 

i takes z into Fi (z) (see Fig. 1), If F had no ecient ic, if the imag iowits 
Point were to be distinct from the point itself, then we could perform the 


—- 


A2.3] 


trick. 
passing t 
Since F(z) 


q function W 


Nash’s Proof of the Minima: 


For each z, let G(z) denote the point where th 
hrough z intersects the boundary of ae 
~ zand since F is continuous, it follows 
hich maps the whole sphere onto its 


Fic. 2 


points fixed, necessitates “ripping” the interior of the sphere, i.e., there must be 
points “close” together in the interior of S which under the mapping are shoved 
“far apart.” Thus, the function is not continuous, contrary to what we have 
shown for G. The assumption that got us into this contradiction was that F 
had no fixed point, so we must conclude F (z) = z for some z. | 


appendix 3 


First 
GEOMETRICAL INTERPRETATION 


OF A TWO-PERSON 
ZERO-SUM GAME 


‘ the 
. : : ° on of 
This appendix presents a complete geometric interpretatl 
minimax theorem when 


elements, namely 4 = 
Consider the game 


ists of t¥? 
Player 1’s pure strategy space consist 
{a1, a} . 


Player 2 
Bi Be 
Player 1 “1 | 411 5s 
2 [421 aoe}. 
> 0, 4 
Any randomized strategy (x10, xa), where x eg 1, 417 * of 
x2 2 0, can be id 


entified with 
length 1, as in F 


ent 

: ; egm 9 
a point (x1, x2) on the line s°8 ‘ 
chooses 8, 


| d play 
1g. 1. If player 1 chooses (xa, x2@2) 4° 
the return to player 1 is 


M[(x1a, X2Q9), By] —— CARE at +t a21X2- 
394 
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A.3] 
Geometrically, we can TN sehen: of Alivia xsi) 6 
oint (x1, x2) as in Fig. 2. 


If, however, player 2 chooses Bo, then 


M{(x1a1, XoQ@9), Bo| = Ajox 9X9 
and a different, but similar diagram results. Superimposing these two lines 
we get a drawing of the type shown in Fig. 3. The particular case drawn 
——__—____—_—— l ———»> 
| 
| 
——— x9 Sane x} % 
| 
(1,0), (x1, X9) (0, 1) 
Fic. 1. The point labeled (x1, x2) is x; units from (0, 1) and x2 units from (1, 0), and 
x] + oo : i 
a91 
B, line 
ai 
(1, 0) (x1, X9) (0, 1) 
) 
; (1, 0) x (0, 1) 
A Fic. 3 
yf Suppo 
se 
2 i. 8 that a2) > ao0, a@19 > aii, and a2 > a@o9. If player 1 chooses the 
ee = (x1, x2), which lies in the interval marked X$”, then player 2’s 
line sponse is 8}. The vertical distance from the point (x, x2) to the By 


re : 
Presents 1’s security level corresponding to (xa1, x2@2). Similarly, 


i -Person Zero ties 
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i best respo Sain tl 
) is in X$”, then Bo is the E.. tthe ven 
if (x1, *2 heavy line r¢ Security | 
ee from (x1, ee te 1’ cic hus on . 
: heavy line represents 1’s securit: as 
Hence, the heavy ‘min strategy, and the v Same js, 
x60) is 1’s unique maximin f 
2 


secu by : fe 
If player 2 were to use Ai; then 1 could s Y empl; 


1a); and if he were to use Bo, then 1 could s »(>2) by emp | 
cut _ Ow ). Hence, to hold player 1 dow n layer 2 musty 
ing (101, Oa). Bo). The payoff is tron: all 
randomized strategy (7181, y262 ) hi 


domized strategies is: 


M[(x101, x22), (9181; y282) | 


= 441*1)1 = a12Xx 1) 2 si aQ1X2V1 +r A22x V9 


= yi(a11%1 + a21x2) + y2(a12%x1 fe 420%9) 
= yiM[(x101, x2a2), Bi] + y2M[(x101, x20), fy 


So we see that (181, y282) yields a line which can be pictured on our dia 
gram as a weighted average of the lines corresponding to 8 and fy. _ 
yi + y2 = 1, the line associated with y = (181, y282) must always li 
between the (; line and the 82 line, and so it must go through the poi 
(x1, xi), v). Indeed, as y; goes from 1 to 0, a family of lines is generates 
which, so to speak, pivot clockwise about the point [(x‘, x{), »] rom _ 
line 8; to B:. For each particular line y chosen by 2, player ! ca 
strategy choice which will maximize his return. In all cases, save a 
the line is horizontal, the choice is either (1a, Oa) or (0ai, 122); ait 
return exceeds ». To be certain that he will hold player 1 down (04° 


: zontal 
must therefore choose the horizontal line in the family. For the horiz0 
line we have: 


SMe, X20), (98, y6B5)) = v, for all (x1, * 


By setting x1 = 1 and then x». 


0, we obtain the equality 


(0) 0 
Ii ayy + yf lays = yao, + yao (=v). 
Since y() 4 0) 


from pl as 1, we can solve for (y{, »§) and for v. 
ct Player 1s analysis, then we can simplify our computatio 


yD 


ly 


If vis ko 
a gligh”? 


nan t= y)are 10: 


re 8 
der assume 


ctu 
that all 2 by 2 games have the same s™¥ onto 
{' 
er 
8 which can occur. Inc, d, and 4, all of Peni u 
In }, ¢, g, and i, player ! x 000 
f the (heavy) minimum fun¢ 


Lest the rea 
the diff 


‘ First Interpretation of a Tw 
A] 
da 


- each case at the boundary—.e., a 
“iaver 1 has an interval of optimal stra 
sserval of strategies. 

In a, player 2 has a unique optim 
was considered in detail above. In/ 
all lines associated with the family (yf 


By bp 
Bo é, 
ot see EE : 
2 
| u : : 
ee NPR eS pee be oe Wa 
(g) (h) (i) 
Fic. 4 


zontal, but only the lowest line is optimal. In d everything is optimal 
Player 2. In ¢ and f, Bi is optimal; in g, B2; in h, 81; and in i, even 
rm of player 2’s strategies are minimax, {2 is 2’s best strategy in the 
strategy ee only is minimax but it is the best response against any 
eid or not one constructs the diagram associated with a game (it 
be done in two dimensions for games where both players have 


t Interpretation of a Two-Person Zero-Su 


398 Firs 
"e 
three or more pure strategies), one 1s interested in tl a (Aj 
*SCCTIONs ; 
lines (or planes or hyperplanes when there are mo * DS Of th 
other and with the vertical boundaries. These px te With eacy 
. TSECtion C- 
vd] 


braically. The above examp 10w that th 
tlldat e lines , 


always be found alge 
ct; may intersect at points 


planes may not interse cre 
(cf. e and h); may intersect at a point representing a si Be be negative 
is not optimal (cf. g); may intersect at the unique optimal ssid he Which, 
may intersect at a non-unique optimal point (cf. /) Cl. a); on 
An extension of this analysis to games where player 2 has more th 

strategies is extremely simple. Consider, for example, the cas se two 
A = {aj, a2} and B= ae Bs}. To hf, j ve im 

j » 2, 3,45 


Fic. 5 


If 2 wishes to hold 1 down toa! most 


involving only B2 and B3- or, } 
Bi oe Br then by using (1 2) Pe 
A a urse, if player 1 fails to play optima ) 
Ty 9 a2)], then it may benefit 2 to use 81) = 
than 5, or 84, since there are mixtures ° 
rable to 84. | 
3h? 


player 1° ; 7 
‘ 1’s randomized strates} € may point out that, if A = {a1, 4 ? 
gles can be identified with the points of i 

Jay 


1’s oT auts » and co 
Possibilities can be pj nversely. If player 2 chooses §j; then “st 
‘ s 


and j € pict 3 : 
if player 2, chooses eulnis asin F ig. 6. Player 1’s payoff if he be 
j 1S the vertical distance from the point of t 
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A. 

horizontal equilateral triangle to the f; plane. B superimp< 
oo B, planes (of course, the diagram is terrifically mess} 

Ei we can examine the minimum function or security level function, v hi 

ee 4 surface whose values depend upon the generic point of the equ Ee 


lateral triangle (i.e., of the generic strategy x of 


plane ——~ 


>8 


(1, 0, 0) (0, 0, 1) 


Fic. 6. There is a one-to-one correspondence between randomized strategies x = 
(x10, 209, x303) and the points of the equilateral base triangle having an altitude of 
unit length. Note that x1 + x2 + x3 = 1 for every point of this triangle. For pur- 
Poses of clarity, the front face (xz = 0) of the game cylinder has been removed. 


Chooses x‘ to maximize this security level. Player 2 uses a random 
Strategy corresponding to the plane(s) which is a linear combination of the 
8; planes and which is never more than v units (the value of the game) from 
the horizontal. 

On the basis of such geometry one can develop a formal inductive proof 
of the minimax theorem (cf. Appendix 1 of Kuhn [1952]). We shall 
return to this geometrical interpretation again in Appendix 6. 


appendix 4 


SECOND 
GEOMETRICAL INTERPRETATION 
OF A TWO-PERSON 
ZERO-SUM GAME 


The following geom 


‘ ictorially 
etrical interpretation can be presented p 
only if m, the numbe 


IN shall 

r of pure strategies for player 1, is 2 or 3. Lee 

illustrate it for m = 2, let the reader visualize it for m fa sae to any 

without proof that the 8cometry of these special cases carries ove! h the 

finite m with Only minor terminological modifications. See 

Concepts cannot be represented Pictorially for m > 3, it is ~~ logy 3 
extremely advantageous to employ the same geometrical termino’'»: 

developed from m — 2 and 3. + Bal 

Let (A, B, M) bea game Where 4 = temas}, B = {81,8» |", 

and let (Xp, Y,, M) 


; For 4% 
; be its randomized strategy extension. t the par 
randomized Strategy y = (9184, J2B2,* + + , ynBn), we can plo 
of values M (a1, y), M (a, y), where 


M (cx, y) = y 215 Vj, 
=1 


j 
400 
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M (a2, y) = ) 


asa pointin the plane. ‘T’hus, the point asso 

tation that its ith coordinate is the return to pl. 

terminology extends to any m: if m = 3, the point r ( 

point in 3-space; if m > 3, it is a point in m-space). If player 2 
then 1’s best response is to choose the strategy corresponding 
largest coordinate of the point associated with y. 

| Let [mi(y), mo(y)] be an abbreviation for [M(a1, y), M(a2, y)] and let 

91 be the set of all such points of the plane generated as y takes on values 

in Y,. In symbolic notation,! 


ocy 
ass 


MM = {[mi(y), me(y)] | y belongs to Y,}. 


Note that to each y belonging to Y,, there is associated a point in IN, and 
that to each point in 9M there is associated one or more elements of Yn. 
If [mi(y’), mo(y’)] = [mily’’), me(y’’)], then y’ and y” should be con- 
sidered strategically equivalent since they present identical opportunities 
/ to player 1. 

| We can therefore view the strategic role of player 2 as choosing an element 
from the set MM. If player 2 chooses the point (m1, m2) of M and player 1 
chooses (x10, x9a), player 1 receives x1m 1 + xgm2. This is a weighted 
average of the coordinates of the point of SW selected by player 2—the 
weights being selected, of course, by player 1. 

The geometrical nature of M is particularly simple, namely, a bounded, 

) closed, convex polygon (i.e., it can be enclosed in a circle of finite radius, 
the boundary of 31 belongs to IN, if two points belong to SM so does the line 
segment joining them, and the boundary is composed of linear segments). 
Tn higher dimensional space (m > 2), the polygon becomes a polyhedron 
and the boundary is composed of (hyper) planes. 


1 : : 
In other words, Sv is the set of points (mi, m2) where 


n 


m, = M(a, y) = > 4139; 
j= 


3 


\W 
(=) 


y is in Yq, ice., » vy =1 and vj 


5 


t 


Second Interpreta 
| 


ers a little more concrete, let us conside, lt 
A er the fol 
OW, 


i T -P 2 ero 
402 tion of a Two-Person Zero-Sup, isk 
To make matt 


game: Player - 
Bi Bo Bs Bs B; 

a, |1 mo 35 4 

Player 1 

a2 SE | 


The set SI is constructed by plotting the points in the plane (7 
associated with the columns of the game matrix and then te 
smallest convex set containing all these points, as in Fig. 1. Fo at: th 
the point m in Fig. 1 represents one of player 2’s randomized ora 
which places positive weights only on (i and Bo. Tater 


ng 


nes wi? 


Tn the followin 
ty "i 


the associ 
ated region 
: s sh A ‘ hit) 
assuming th own in Fig. 2, There is no loss of 8° 


“8 Mat MW is in - 
cimensional the positive quadrant (orthant, if we 2" - r: 
a game d 
coordi 5° = ihe strategic considerations involved: pelo 
reinate my ig lar 1s dotted and labeled /. Note a go 
ample : than ms, and that, above /, m2 is larget | a! 
m, ms) is, '$ @ good response to player 2’s choice of 
im to s i Point of IW on or above / go" 
€ ; . ae 
Ost * @ point (m1, m2) of I such that 1 nom 
af ss of any other point of IM; in ot é vt 
m - + max § : 
WANES to hold 4 go) Corresponding to a minim? 430! 
Own to as little as possible, 


Second Interpretation of a Two-Person Zero-Sum Game 


A.4] 
consider various values, such as v*, and to ask whether 2 can hold 1 down 
to v* or not. It is easily seen that this is possible i] 


contains a point both of whose coordinates do 1 


Fic. 2 
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occurs if and only if the region labeled 9(v*) in Fig. 3 contains at least 
One point of IW. Formally, 


M(v*) = { (mi, mg) | mi < 0*, me Guy *} 


so *) : A eee ra age 
WU(v*) is the negative orthant with its origin displaced to @A;. o*)x 


Interpretation of 


lue of the game) 

“ots a value Y (the va : 

ge Pon to v but not below v. Hence a 
ee uary poi t(s). See Fig. 4 for these poi 


les. | 
tine quantity y* is small enough, then (in : 


will not be able to hold player 1 down to v*. | 
‘oint from IW. As y* increases, the displaced 
os translated in a northeasterly direction, s 


quantity 7 such that 


404 Second 


Fic. 4 
Any point that 


eds 

Bead etn ‘Wo convex sets su 

Pd dimension) sepa 
ra 


v* that Strategies ree, 
a” the sets SU(v* 
a “segment joi 

f values Dnt 

: Sy limitin 


a Two-Person Ze 


IS, U0*) wil 
peak, and there j 


9u(v*) just touches IW when v* equals », 


Player 2 chooses which is common to Mt and r t 
© at most v. Hence any such point corresp “il 
8y for Player 2. Observe that, in the games corres 
the minimax strategy for player 2 is unid ; 
le segment of minimax strategies for play® ¥ jst 
ch as IN and 9(v), there isa line (BYP yoni 
ting them.® That is, a line & 
equivalent payoffs are considered identic 
) and MK are disjoint, and so the perPe” tis? 
ning JU(v*) and IM separates those bod! per 
approaching v in such a manner 


: x & line which can be shown to S¢ : 
Rithis appendix carries over t© the 


m Game 


Playey 
2 can 


“iar game) Player 
Ibe 


/! 


(hl 


4 
ue, 


sts 


i: ' ct 
ar ee 
je? 
i¢ 
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that the ne 
arate * 
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poth St and 91(v) such that SI lies on one side of tl in 
other side. (As can be seen in 8, this line 
show how it relates to player 1’s solution o! 
Let L be a line separating IW and 9i(v). Su 
as the set of points (71, mg) which satisfies a1 


0) } 
xm, + x3” me = k, 


where (x(””, x”) determines the slope of the line and k fixes which particu- 
lar line is chosen from the family of lines whose slopes are dictated by 
0 (0) 
ei Xo ). 
It is easily seen and is easily proved that: 


i. Neither x{) nor x§°) can be negative [for otherwise we could find an 
(m;, m2) on the line which is interior to 9U(v), i.e., not on the boundary of 


K(2)]. 
ii. The point (2, v) is on the line L (check this in c and d above). 
From (i), we lose no generality in assuming x{?) + x§ = 1 (for if 
x60 + x{ = 1 we could consider the line 
x) nh xi k 
gy ey 8. =) oY 
xO $x xO $f x0) xO) 4 x{0) 
and relabeling we would get 
xy'my + xo'm2 = KF, 
where x)’ + x’ = 1). Since (i) enables us to take x + x{? = 1 and 
since (v, v) lies on L, we may conclude that 
xy + v=k= (x6 + x0)v = x, 
sok =v. Hence 
L= {(m, m2) | x{ m1 = ig xs°me a v}, 


where x 4 x6 = 1, x > 0, and xs” 2 0. But then all points to the 
right of L, or above L, must satisfy 


xm, + x me > v. 
Thus, for any (mj, m2) of IW, we have 
xm, + x me 2 v. 


a ss aati aaa ae ee ee 
number m of pure strategies of player 1 is greater than 2. Instead of the geometry 
8 embedded in 2-space, it is then embedded in m-space. The separation of the 

© convex bodies SIL and Mv) by a hyperplane is by far the deepest mathematical 


ai needed in the proof of the minimax theorem which results when the outline we 
ve given here is made rigorous. 


n of a Two-Person Zer< 


d Interpretatio 1 Game 


on 

406 See (0), | ld, 

if player 1 chooses the strategy (x; e Ne Can Ret as 

Hence, #! Pp? ma) of 3. Thus such oy ene 

v for all points ae Xin | 

if : 

player ;, and d the separating hyperplane and $0 play. 
i is unique. In b, there 1s n QUE Strate, 

maximin strategy Hneline. Inc, the li rtidad BY Sing, 

there is not a unique separating , ena and hence 

form g 
a 1m, + Ome = 2, 
Le., 1 is maximin for player 1. [Indeed, « is best for all (m, ms) ¢ 


91, since all of I, is below J.) In d, the separating line is horizont 


namely: 


Om, + 1me = v. 


In other words, a2 is maximin for 1; however, it is not best for all (m,, n, 
of Mg since Wg intersects /. 

"This completes the story as far as optimum’ strategies are concerned, 
In addition, using these same geometric considerations, it is easy to see the 
possible effects when player 2 chooses a non-minimax strategy. To thi 
end, let us suppose that player 1 chooses (x,@1, x2a@2). If player 2 choos 
the element (m1, m2) of SM which lies on the line 


xm, + xome = fk, 


then player 1 receives an amount &. Consider the family of lines 


xm, + xom2 = k, 


w « 
k takes on different values, as shown in Fig. 5. Since playé! 
Be i *202), any of the points on the same line of the family yi" 


the iS Q 
oat, a Hence player 2’s response to (x1a1, *2%2) mere : 
in€ in the family: b : railable 
ut are ava 
player 2 nly tho y; but, of course, not all lines esos 


W(rie:, xs0,) = xj which contain points of 9. Thus, his bet". 
>] 2 => x . 4 ; r and }s @ 
far left (or down) is the line which both contains elements of git a 


a hea’? 
line represents sb Possible in the family. For the case draw | 
est choj | 
layer 1? + "A 
st ee xpected return when he uses x and when 2 choos nf 
oe ty the security level of strategy x, which a 
quation for the heavy line is 


1 xm + xymy = v(x). 

Not Oughout, “opti dit sot 
m » ’ 

Sta allowed to aie means either “maximin” or “minimax, “ ” gue 
Nts are th, € the flavor of “this is elit Adve player shoul 


ie meta a 
Same theoretic, not game theoretic. 


— 


4 

Second Interpretation of a Two-Person Zero-Sum Gam¢ 4 

A.A] a eos bi 
This line must intersect the 45° line at the point [vi (x), 71(X)]. ¥ 
i : . 


~ wage wa < 


resent the security level of a strategy x, draw the family « 
ee ’ bd » | 2 Lk: c. Ay i : a 
ae with x, and choose the one which is a left-sided suppo! 

ci ait | 


mon value of the coordinates of its intersec 


eer 
SS 


é 
{ 


The com ee 
], is the security level of x. Player 1’s security level 
? 


Line of left ea 


mo 


for x: Line: xm, +x2m, = 
xym,+ XQMo= v(x) ak aN 4 \ \ \ 
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strategy x‘ such that its line of left support intersects / as far to the right as 
Possible. Clearly, for the case shown in Fig. 5, the maximum security 
level is given by the line (x(a, x$ a2). The reader will find it profitable 
to check through the above discussion for 3%, 9W,, and Iq, shown in Fig. 
4. Remember that, since x; and x2 cannot be negative, the slopes of the 
lines cannot be positive. 

The geometry of the minimax theorem described in this appendix was 
formulated by Gale, Kuhn, and Tucker in 1948. Later it was utilized 
by Gale [1951] and Karlin [1950]. 


appendix 5. 


LINEAR PROGRAMING AND 
TWO-PERSON ZERO-SUM GAME 


: rst we will dau 

This appendix is divided into three sections. eee tach ac 

te that a two-person zero-sum game can ar ee 

line : rograming problem. In the second we a ory. The mater 

sen aming problem and discuss its ... athe third e 
in section 1 serves as motivation for the duality t how the general ls 

tion we will employ the duality theory to show a two-person Ze! 

Programing problem can also be interpreted as 


sna 
we S 
p - however, hieve 
. aj; nl 
game. The Principal reference is Dantzig [1951 ah attempt to 2° 
depart from Dantzig’s treatment at several points in 
Maximum clarity. We sh 


1951]. 
all also use Gale, Kuhn, Tucker 


A5.1 REDUCTION 


OF A GAME TO A LINEAR- 
PROGRAMIN 


G PROBLEM 8 M) 

ame f(a 

Let us assume We have a specific two-person Baa aii los 

where 4 = fei. reo = {81,°-- ,Bn},an ntail a) 
B;) is POSitive for alli and 


e 
J- The last requirement does a 
he same Positive quantity 
ategic structure of the game. 
408 


A5.A] Reduction of a Game to a Linea 


Player 1 can guarantee himself at least v’ 


SS ae Xm), Where x; 2 0 and 


M(x, B;) 2 2%, for; = 1 


which is equivalent to 
™ 
ate EAS Laney tg (2) 
- Ajyjxi 7 Us, for j = 1, 2, _n. 2) 
i=l 


By dividing eq. 2 by v* and writing x,;/v* = uj, it is seen that player 1 can 
get at least v* if there isa u = (ui, U2, °° * » Um), Where u; 2 0,for? = {; 


2,: °°, m, and > u; = 1/v*, such that 
i 


> a;ju; 2 1, for jose ied, "5M (3) 


Equation 3 is equivalent to eq. 2 since multiplying by v* and writing 
uy * 


v* = x; yields eq. 2. Consequently, we can view the problem con- 
fronting player 1 as follows: 


Player 1’s problem. Let Ube the set ofallm-tuplesu = (uy, Ue, ae 
Um) such that 
u; > 0, foe es a, 
and 


Y aigus 2 1, oer SS 2y 


To find those u belonging to U such that ) wu; is a minimum. 
i=1 
Remarks. (1a) If u = (uy, * + * , Um) belongs to U, we have seen that 


‘ 1 
Player 1 can guarantee himself at least a In order to secure the maxi- 
uj 


ine guarantee, player 1 should attempt to find a u in U which maximizes 


) ; or, equivalently, minimizes us Uj. 
i 


- a 
i 


(1b) The problem of minimizing a linear form such as y u; (or, more 


+ 
ene ° eas = < 5 5 
Senerally, Y ¢iu;) subject to restrictions involving linear inequalities such as 


1 


i 
i 
¢ 
i 
: 
7 
i 


> 
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Equivalently, it is easily seen that player 1 can get at most o* if thereexs 


aw — (ei, @s, ~~ - , w,), where w; 2 0, for ; = 1, 2, 
\a=1 2*, such that 


} azn; <1, ms = 1, 2, 


Consequently, we can view the problem confronting player 2 8°" 

; =- - . tS ae 

Player2’sproblem. Let Ii”be theset of all n-tuplesw = (#1,"2 
2.) such that 


and ws; 2 0, for } = 1, > a rn. 


i 
To find those w belonging to WV 


such that y w;isa maximum. 
ind ‘ 
j=l Bea have = 

that pla (wy, Wea, a . 2») belongs tn Ww, we b 

yer 2 ca 
hold Player 1 down to at most ———" 
Player 1 doy a" 
7 ol 
48 much as : ; God 3 


W which ae 1 Possible, player 2 should attemp* to 


- « - loan 
>, oa * OF equivalently, maximizes > ij 


In order ©" 


— 


uction of a Game to a Linear-Programing Problem 
A5.1] Red f 


(2b) The problem of maximizing a linear forn 


generally, ) b,w;) subject to restrictions involvis 
4) 


= ( a . ~~ ca } , q] 9 ~ 
as aj j0j € 1, for: = 1, 2, - + - , m (or, more genera f Uij"5 
; S ¢ : a) ae fee rales 
oa 2, ~~ , m), where w; 7 0, for; = 1, 2, - 


d is alictu a 
hiy by 


linear-programing problem (of the maximizing variety). 

b) ee = je d = 

(3) Player 1’s problem and Player 2’s problem are said to be qual 
linear-programing problems.’ 


Any w in W guarantees 1 
uj 


1 
Any u in U guarantees player 1 at least ) 


¢ 


at most ee Since player 1 can get at least : and at most =——» we must 
Dt dw 2, i 
j i j 
have 
Sia nail 
due Da 
¢ j 
ise, 


But we know that all zero-sum games have a value which can be inter- 
preted as follows: Player 1 can get at least v (i.e., there is au‘ in U such 
that 


1 


a =), 
y uy 


t 


and player 2 can hold player 1 down to at most v (i.e., there isa w™ in W 
such that 


is 
Spans s 
0 
dos 
S j 
ummarizing, we have the symmetric problem. 


“In remarks 1b and 26 following the two problems, we indicated how the linear- 


(hae problems are generalized by introducing numbers (51, 
rh a 


cms) bn) and 
eee Cm) as data of the problems. This was done in such a manner that the 
Problems given in 16 and 2b are also said to be dual. 


- and Two-Person Ze ” 
4ig «Linear Programing = 
: 2 Tofindu’’ in‘ ae 
Symmetric problem. 1ch tha 
n m 
pur- 5. 
4 =l ia 
Remarks. (4) If a, w solve the syrmm« ot tie 
1 l 
v= Sa 
0 ( 
Tap Se 
j a 
( iA) et 1 
and x‘ = ie”, “a > where x5" ae i yori = 1,2, “7am 
imi = “ <CaO (0 Eee (\)) ee 
is maximin for player 1; andy = (yi » , 9), where yi = ow!) 
forg=1,2,°°°> ™ is minimax for player 2. Conversely, if (x 
y‘”, v) constitutes a solution of the game, defining uf) = xi /a, wi = 
y/o yields a solution of the symmetric problem. Furthermore, u” isa 
solution of 1’s problem, and w°) is a solution of 2’s problem. 


A5.2 DUALITY THEORY OF THE GENERAL LINEAR- 
PROGRAMING PROBLEM 


The data of the general linear-programing problem are an n-tuple b = 
(bi, bo, Oe 8D 5 be), an m-tuple c= (¢, 02; mie © -.), and an m by h 
array of numbers a;;, where i = 1, 2, - - - , m andj = 12,0070! 


The , ‘ 
omg?! be any numbers—in particular, they are not assumed to be 
non-negative. 


> 


u Ae minimization problem. Let Ube the set of all m-tuples U = (hs 
St ae) a) such that 


(i) ui 2 0, for i = 1, 2 ms 


(ii) uja\; “4m. 
iV , Rar: 
Find a a Und; 2 6; forj =1,2,°°°°™ 
Pe thos i F rm) 
€u belonging to U such that the index (which is 4 jineat fo" 
is a minimum Oily Coo + + - + Cintlm 
The Aes , 
Maximizati z (wh 
on we 
Wa-+- problem. _tuples 
; » Wn) such that Let Wbe the set of all 4 
i) w, 
> Wi20,forj=1.2 .. 
a1 > Nn. 
Pot et... 4). n. 
More gen, ; + Wrain < Gq, fort = oo, °° * Pr 
1b and 25 erally, if the dual nee of e™ is 
» then the & ae Problems are taken to be the general versions ” ch! 
ric Problem is to find a) a U and w® in 


t 


a 4 (0) 
ee De > bj. 
re Se j 
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Find those W belonging to W such that th 


byw, + bow + 


is a Maximum. 
The symmetric problem. To find those pairs 
to U and w belongs to W such that, 


oy) + cote + °° 1 omlm = byw, + bewe + °° 1 Onn: 


Remarks. (1) The maximization, minimization, and symmetric prob- 
lems stated here are obvious generalizations of the problems encountered 
in the previous section. 

(2) The “diet problem” discussed in section 3 of Chapter 2 is in the form 
of the minimization problem. In that example, the following interpreta- 
tions were made: a;; is the amount of nutrient 7 per unit amount of food 2; 
b; is the minimum amount of nutrient j required; and c; is the cost of a unit 
amount of food i. A “‘diet” is an m-tuple u = (uy, * * * 5 Um), where u; is 

the number of units of food i in the “diet.” 
(3) For given data b, ¢, [a;;] it can happen either that there are no 
m-tuples u in U or that, although there are m-tuples in U, there is no 


lower bound to the index Y cu; In either case the minimization prob- 


lem has no solution. Similarly, W might be the empty set or there may 


be no upper bound to the index ) jw; when it is non-empty. Again, in 


2 
either case there is no solution to the maximization problem. 


Principal theorem of linear programing. 
1. If there exists a u in U and a w in W, then 


Pee 8 A co, 2 bits tr Ona: 


0 . . 4 
< If (a, w) is a solution to the symmetric problem, then u® isa 
solution to the minimization problem and w°” is a solution to the maxim- 
ation problem. 
0): : Pinte =r : 
< 3. If u™ is a solution to the minimization problem and w is a solu- 
lon to the maximization problem, then 
0 
yu foe + $c = byw{? + > + + + baton” 


n 9 


Le. (yO 0)) ; : ‘ 
- _ , w) is a solution to the symmetric problem. 
- Ifa solution exists to one problem, then soluti i 
ons exist to the 
two bigs ’ other 


5. If bot 
1 aaa h U and W are non-empty, then all three problems have 
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| f. We will outline two different proof: this theore i 
1ark. aa ane theorem, 

i Ren ‘ ‘n small print in the remainder of th n, does n he 
first, given ! for games, but rat! Ot den, 
on the minimax theorem for g , ‘a Na theore, ., “ 
| solyhedral cones due to Farkas. Thus, this self-contain HL. 
i (0) proo es ae CC jn 
| fat it includes both the  -_— Pe duality thn, 
} ; i this proo! c: ed yd 
| Spemeiprogeaming. Incidently, this P | to provide 


ini theorem. In ti Pinaud | 
another proof gerne sminima SECON, a se¢pn, 


j i i li t neralizz iC) 
proof is given which rests cc as gh ‘ge ion of the Rs 
i the intimate rel; 1a 
theorem, It demonstrates clearly h 


{ iC 38 a} j A 
PODSAID between tWo 
person zero-sum games and linear programing. 


——— 


> Proof. 1. This follows from a chain of three inequalities: 


», 8303 < y (9 was) i > y 2:32) ui & » a 
7 j i t j 4 


The first of these arises if we multiply the jth inequality of (ii) in the minimization 
problem by w; and then sum over allj. The middle equality follows from a chang 
in the order of summation. The last inequality arises if we multiply the it 
inequality of (ii) in the maximization problem by u,; and then sum over all. 

2. If (a, w™) is a solution of the symmetric problem, then u\ belong i 
U and w to W. Furthermore, by the inequality of part 1, the index for" 
must be a minimum and the index for w© must be a maximum. Hence! 
and w™) are solutions of their respective problems. 
_ 3and 4. These two assertions are mathematically much deeper than the prectt: 
ing ones, and the proofs are correspondingly more difficult. We shall be conte! 
merely to outline the nature of the proofs, which hinge on the following non-trivi 
lemma (first stated and proved in 1902 by J. Farkas). 

Lemma. Let the following array of numbers be given: 


diy di2 Set Air 
doy d22 a * doy 
dpi dn2 e 28 6 dpr 


dp +1, 1 Gott, 9 O dp+1, r > 
where no row 


Ae are consists entirely of zero elements. If for any r-tupl¢ (p1; P% 


Pidsi + pedip +... > 0 
ir ’ 


: mal 2 : » ps 
it follows that for 2 ) 


Pidy41, 1 + Podn+1, 2 -+- cure + prdp+1, r 2 0, 


then there ex; 
exists qa p-tuple 2 = (Ay, > Ey); such that 


and M20, fori =1,2,--°.- 
ms Audi; + ge 
Be: : so "a FAgdy; = ap + 1, 9) for j = 1 oom 
De: 
7 , isi . 


. 
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Remarks. The lemma asserts that, if whe 
angle with each of the first p row vectors it als 
(p + {)st row vector, then the (p + 1)st row ' 
bination of the first p row vectors. When we 
a non-obtuse angle with d; = (dj1, ° * * , dir), we 


Bidet te. oS A Piss 


Y 


Some geometrical insight into this lemma can be ) 
y= 3andp = 3. Then the three rows can be identified with t ints in 3-s] 
(See Fig. 1.) The row vector da = (d41, d42, d43) is a non-negative linear combina- 


tion of row vectors d; = (d11, di2, di3), d2 = (d21, d22, d23), and d 


dss) if and only if d4 is a point in the polyhedral cone in Fig. i. To illustrate the 


dg = (31, d39. d33) 


d, = (441, di2, dy3) 


Fic. 1 


plausibility of the lemma, it suffices to show that, if dy does not belong to the cone 
generated by the other row vectors, then there exists a vector @ = (01, 2, Ps), 
which forms an obtuse angle with dy (ie., dy > @ = @4101 + @1202 + dasp3 < 0) 
and a non-obtuse angle with dj, do, ds (ie., dy > @ = dei + di202 + di393 > 0 
for i = 1, 2,3). If dg does not belong to the cone, then intuitively it seems clear 
that there is a hyperplane going through the origin which separates dg from the 
cone, where by “‘separates’’ we mean that dy is on one side of the hyperplane and 
the polyhedral cone is on the other side. ‘The deepest mathematical aspect of the 
€mma is the proof of this separation property. Assuming the existence of the 
Separating hyperplane, we now show that its algebraic interpretation yields a 
Proof of the lemma (in this special case). A hyperplane passing through the 
Origin is the locus of points d = (dj, da, ds) such that 


dipi + dops + dsps = 0, 


“e Some suitable 3-tuple @ = (1, O2, Ps). Put in other terms, the hyperplane is 
en locus of all points orthogonal to the vector @, Since dy, dg, and ds lie on one 
ide of the hyperplane and dy lies on the other, the quantities 


d; > @ = diipi + diops + disps, ? = 1, 2,3, 
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are of one sig? and y 
ds: 0 = 441P1 + d42p2 
is of the opposite sign. Thus, the orientation of ¢ 
and d;:@ > 9 ford = 1; 2, 3, a8 was to be shown : - dyg. 
be extended to A 7 SeParat 


roved in general, can 


property is p 
Having paid tribute to the lemma, let us retu: Pies 
parts 3 and 4 of the principal theorem. To this « ee provi 
a solution of the minimization problem, then there exists a Vv oe F a yy 
UCN that 
“a Ab mu < byw” + (0) 
Once this has been established, then we know by par‘ at the equali 
t . ‘ 1e uz was 
hold, and so, by part 2, w) must solve the maximization ance ItY sign mus 
Consider the following array: ae 
0 a pe a Um i, 
ZA 1 0 7 > 6. 
zy 0 1 0 
ee 0 
0 
a 0 0 1 | 
2% 0 0 
m+1 0 
wi? a11 
wo at a Ami —51 
: = 422 * a Qm2 —5e 
wo 
—— ain @2n ees: Amn —bn 
C1 C —— 
The quanti ; = - a 
antity mu is defi 
aolnags , ned to b (0) +f 
gen of the minimization Ein + cous + + + + +cmu{t?, where y isa 
fe, Par the @ vector in the 1 em. The vector (u1, st 5 Um; 2) will ply 
1, Ca, °° emma, the number m + 1 + n the role of ps 2% 
st show 


that, i "5m; —4) is the 
bit Qs, ++ + tn, 2) ae + 1)st row. To apply the lemma, we ™ 
a non-obtuse angle with each of the first m + he 


TOW vector: ° 

8, it also fi 
row) O orms a non 
: nce this j -obtuse an 2 : 
numbers 2  . 1s 1s shown, the lem gle with the last row (i.e., the 
o> a ma establishes the existence of non-ne 


> “m41) W we 
ai VS 4 wo Sieh that: 


gat 
(1) 2f% 
1 + wi(% ; 
(2) zy wi ee tte iain = 61 
5 A j] @21 + bite <a a warn =n 


(i) 2(% 
mets + wa, 
; ee es + wan = 


Cm 


\ 


14 2 
mi + + wo amn 


0) 
m+1 (w(5, + Bees eetPd,) = —p 
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But, since 26 2 0, the ith equation gives 
2 


I (0) 


wai + ee ar 


; 0 BIN Ben atin sneld 
and since 2, 2 0, the (m + 1)st equation yield 


or ais p60) 4 (- 
wibi =i +w n bn 2 ob KS 


But this means that w = (wi, w, - 
tion problem, as was to be shown! 

To finish off the job, we still must show that, if (wi, °° ° » Ym 
obtuse angle with the first m + 1 + 1 rows, LG.; 


, z) forms a non- 


(i) uj 2 9, Tome alee, 92" *) 5 M5 
z20; 
(ii) uiaig + °°) Th Umamj — zb; > 0, forj = 1,2,° °° 4m 


then (ui, °° , Um, z) forms a non-obtuse angle with the last row, i.e., 


(iii) uyituxe+t +tm6m — zh 2 0. 


We will consider two cases, namely, z > 0 and z = 0. 


Greg (z> 0). If (ur, °° -° 5 Um, 2) satisfies inequalities (i) and (ii), then 
(u;/z, u2/z, °° * , Um/z) belongs to U and therefore 


uj u2 Um 
eid 62 + ae at +—cm —» 2 0, 
Zz Zz 


since p is defined as the minimum of the u indices. 
multiplying the last inequality by z. 
Case 2(z = 0). Suppose (ui, * * * , Um, 0) is such that 


Statement (iii) follows by 


u; 2 0, fOr tealeaene tn, 
and 


ujayj + °° * + .Umamj 2 9, for we 1, 2. ceo tts 
then we must show 


uyjC1 + ese + Umm 2 0. 
ne , : ee nee 
If u = (4, - - - | 4) is a solution of the minimization problem, then we 


assert u* = (4) + dui, uf? + Au, °°", u\ + Aum) belongs to U for all 
A 2 0, since: 


eae hu. > 0; fori. = 1, 2.7): 


> m, 
and 


(6) (uf + rurar; + (wW + Aua)aay + °° + (CU? + Aum)oms 
{uar; + ua; + °° tuPams} + (A@uraiz + + + + + Umams)} 


+ uam; (since the second bracketed expression 2 0 


% by hypothesis) 
2 b;, forg/ = 15,2; 9: 75 n. 


> y/ 
eo ay: 


= 
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the index of u* must be at least p, so 


Furthermore, 4 | 
a a \ 
ns GO + Ane + + (us + Aum 
; (0) , 
ia F tin em} (A(uy T UmCm) | 
TS as tlm)» 
Hence, it follows that : 
| uycr to! Tt Umlm 2 
as was to be shown. sy ;. 
Summarizing, we have shown that, 1 a is a solution to a 
problem, then a solution w‘” to the maximization problem exists and - 
— 0 oe e@ l 4 
7 + cmb = byw + + baw'® 


In a parallel fashion we can show that, if w‘ is a solution to the maximizatio, 
problem, then a solution u) to the minimization problem exists. This estab 
lishes 3 and 4. 

5. Part 1 asserts that, if u belongs to U and w to VW, 


> en 2 Y oxes 
, j 


pee, the set of numbers 1) cuit, for u in U, is bounded from below. Let yb: 


t 
the greatest lower bound of these numbers. We wish to show that there exists 4 


u in U such that 
> eng = yw. 


t 


Instead, we shall show there Beets in W such that 


3 bjwi = p, 
7 


which wi 

4, the poe that the maximization problem has a solution; and threo 

such a w() ig Petia symmetric problems have solutions. The nee’ of} 

and 4. The ietias by making some minor modifications in the Pr at 

Proof of 3 and 4 remai reader should check to see that all steps encounter 
main valid except for the following points: 


i. wi 
1s not defined as cui 


that such a u(0) 


p 
s ‘ ards kno" 
since in the present proof it is not initially 


exists, 


t 
In : er bov 
eu; for all u in U stead, is defined to be the greatest low 

7 . 
li. In the 
roof fi 
where u( j 5 Or the case z = wt = 2 ght 
u’ belongs 18 a solution 0, we can no longer take 


to Cs Ce @ 
to U and the minimization problem. Rather, w°™ 


BS) cau! <n +e 


ry 
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: ee Be T Ths 
for some preassigned positive €, however sma Ch 


wpoOpte+a 


But since € is arbitrarily small, we can conclude tl 


T° 


Ly UjzCy 2 0, 
t 


which was to be shown in that part of the proof. 


This concludes the first proof of the principal theorem. < 


A5.3 REDUCTION OF A LINEAR-PROGRAMING 
PROBLEM TO A GAME 


We have a dual aim in this section: First, as the heading advertises, we 
will show how a linear-programing problem can be reduced to a game. 
Second, we will prove the principal theorem of linear programing by 
means of the minimax theorem. 

Consider any linear-programing problem of the minimizing or maxi- 
mizing variety described earlier. We shall now exhibit a two-person 
zero-sum game whose solutions provide solutions to the linear-programing 
problems, provided solutions exist at all. The appropriate game matrix 


1s: 

By Be ‘eee Bn Bn+1 Bn+2 Bn+m Bn4m+1 
ay 0 0 See 0) gh ae OT wees Crs by 
a2 0 0 hee 0) —a\2 —4d422 ses el alan bo 
Qa 

n 0 0 0 inno, -«.% —Aun Ox 
On+1 a\\ a\2 Qin 0 0 0 Set 
On+9 a21 a22 a2n 0 0 0 =€9 
Qa 
ie aml Lente «ses ann 0) 0 Ariens 0 (055 

n+m+1|—by —bo Pn OR C1 C2 Bite Cm 0 


re the game matrix is skew symmetric, one conjectures that the 
“aa € game must be zEr0. This is easily shown: if both players use 
ia: mixed strategies (i.e., put the same probability weight on their 

Pure strategy for j = 1, 2, - - - , 2 -+m-+ 1) the payoff to each is 


¢ n Z -ro-§ (Newen 
Linear Programing eee = erson Xe =e 


420 ‘ _ [A5,3 
. egy which will gt — 
zero. hus, there is no strategy tla post, 
eturn. i : _— 
r Since the value is zero, the mixed strategy 
é (0) 7 (0) ae NZ 

(2B. EC , 2h” Bis oye > -n Bns Zn+1Pn-4 LiPn4i, * + . 

) *n-+mt Bn ma 

. ae s Ee ee 
ss minimax for player 9 if and only if it hold: 1 down to 0, thy 
is, if and only if 
rm+1 
‘ \ / 
AY) > 0, aie jn tm + i, 2 2) = 7 
k=1 
and 
é 0) a i tee (0) ) 
@ (eng +--+ + enya + $29) ting] + 2 nash 
“0,7 = 1, 28 

e bs af 
(ii) [ay tere Ht 2 a4; es - zi Ma, | = 20) iti < 0, 

2 hye 
(iii) —[2(by “3 “ie at rd — Pesci os ae sa Ze akon £0. 


The principal theorem of linear programing, as we have stated it, con 
tains five assertions. The first two are easy and we will assume them 
proved (cf. the proofs on p. 414). In establishing the other three as 
tions, there are two cases to consider. 

Case 1. There exists a minimax strategy for player 2 with rt #s 
Dividing each inequality of (i), (ii), and (iii) by 2{°),41 and denotne 


(0) /,(0) : 
Znti/2n4m+1 by a, p= 1,2,°°°»™; 
(0) /,(0) 
2j ert by o”, fopgpeee 1,2,°°° 5% 
we find that 
(0) 
u = (1,60) (0 
(ur, us, - , u\)) belongs to U, 
(0) 
wo = SO).-.,(0 
and ag, 26°), - , w°) belongs to W, 
(0) 
wy b Cea 0) 
a ae +w, > ue, +--+» + uew is 
M assertion { ing 2” 
of the princi ing * tt 
aPove inequality we en theorem of linear progta™ : e ym 


WY and lude that (u, w) is a solution ° fe g 
and t : : eor 
ey % herefore by assertion 2 of this same tb ole 


3 € soluti at 
respectively, lutions of the Maximizing and minimizing 
Case 2 
Bop BPC does not oxi ‘2 layer * 
"m1 >0. For this exist a minimax strategy for P 
(a) Either or ea at will first show three things 
Pty. 


) wi? 


—  ~— 
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(b) If W is non-empty, then the index 
wb + u obo =e 


where W is in W, can be made arbitr 


problem has no solution). 
(<) If Uis non-empty, then the index 


uycy + U2 + 


where u is in U, can be made arbitrarily small 
problem has no solution). 


(1 ea ti 17 ITs. on 
\1.C., tit Piisiiis iZati , 


Once a, }, and ¢, are demonstrated, the three remaining parts of the 
principal theorem follow easily. For, if U and W are both non-empty, Or, 
if solutions exist to either the maximizing or minimizir 
case 2 does not hold; but when case 1 holds there is a minimax solution of 
the game which yields solutions to all three versions of the linear-pro- 
graming problem. 


Before we establish assertions a, b, and ¢ we will prove three preliminary 
remarks which are valid for case 2. 

i. The crux of the proof depends upon the following assertion about the 
game with the payoff matrix given on p. 419. If all minimax strategies of 
player 2 yield a return of exactly zero against Onim+1, then player 1 has a 
maximin strategy which puts positive weight on an4m+1- But, by the 
symmetry of the problem, this would mean that player 2 has a minimax 
strategy which puts positive weight on Bn4m+1 and, under the assumption 
of case 2, this cannot be. Consequently there is a minimax strategy z‘” 
for 2 which gives 1 a return /ess than zero against Q@n4m41- This means 
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m 
> 2 aij 20, for all 
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ere. 48 ber z\° i Sees 
by the inequalities (i) on P. 420 (remem > ASsertion (j 


follows. 
i. If ther 
show . 
» yb; x 0. 


ga 


, Pe i i -emp sit (0) ak 
e exists au’ in U (i.e., U is non-empt for 2 of (i). 


Observe that, 


n 


n ™m ™ v 
0 (0) lh Jt ,(0) 
> z b; < . 23 » ait! ) > u; ») 2; ais), 
1 t=1 t=1 4=1 j 


j= =1 


where the inequality follows from the requirement that w’ is in U and the 
equality follows from a summation interchange. Since 


n 


> ai; £0, for alli, 


i= 


by the inequalities (ii) on p. 420 (remember 2(°} ,,,., = 0) assertion I! 
follows. 
Now back to assertions a, b, and c. 
_Assertion a, that U and W cannot both be non-empty, follows because 
(ii) and (iii) are then in contradiction. 
Assertion b, that even if W is non-empty no maximum exists, is proved 
i follows [note: The z‘ used in this proof is the same z(° used in Pr” 
minary remarks, (i), (ii), and (iii)]: 


If w’ = / 
= Ww : Ey. ok Ce 
(wy', Tot i) , Wn’) lies in W, then so does 


U (0) 
(w;’ + VS ; w,! aie at”, ee wy! + zi) for A > 0, 
since 
os (w,’ ( . . 
ijlWw; (U))\ eae = bj 
for alli. B : a oy 
ut the index for this point is 
n 
S: b;(w,’ (0 . : 
4 i j + Az} ) = » b jw! a> () p26), 
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and since by (ii) of Pp. 421, 


j=1 
this index can be made arbitrarily large by making A large enough 
Assertion c, that even if U is non-empty no minimum exists, 1s pro ed 
similarly. It requires the dual of assertion (ii), namely - If there exists a U 
in U then 
™ 
SOs ; 
», En+it t < 0. 
i=l 


This completes the demonstration. 


A much simpler reduction of the linear-programing problem to a game 
can be given provided all the components of b, c, and the matrix [a,;] are 
positive. In the minimization problem, given on p. 412, make the change 
of variables u,’ = cu; (i = 1, 2,° °°, m) and a,,’ = a;;/(b;c;). The 
problem then reduces to player 1’s problem on p. 409. Observe that the 


inequality 

> u;aij = b; 

A 
becomes, on division by ),, 

u,;'a;;' 2 t. 

Similarly the maximization problem on p. 412 reduces to player 2’s prob- 
lem on p. 410 if we let w,;’ = b;w; and a;,’ = a;;/(b;c;). ‘Thus we are led 
to the study of the game {A, B, M’} where 


A= {a1, cat ae Gig tn ie = {Bi, eae Poet M' (au, B;) wa aij’ a a;;/(b;¢;). 


Tf x = (x0). . - | x) is maximin for player 1, Py) Sig = 


Reet is 4 ia ; ; 
y\) is minimax for 2, and if v is the value of this game, then u‘? = (u6, 
- , u\), where 


us = x /(cn), i= 1,2, +574 m, 
is a solution of the maximization problem; and wi mace zak), 
where 
wo = yf /(b;2), i= pe Be Ge 
sa solution of the minimization problem. 
1 a this case the equivalent game problem is m by n instead of (m an nes 
y (m+n+ 1). However, as we shall see in the next appendix, for 


certai ’ ee - 
‘rtain computational procedures, it is advantageous to have the game in 
Symmetric form. 
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SoLvIN G TWO-PERSON 
ZERO-SUM GAMES 


A6.1 INTRODUCTION 


(with mind! 


To render this appendix relatively self-contained, we repeat (see 


k 
modifications) some observations made earlier in the body of the be 
section 4.12). 


ymbel of 


Z ite 1 
> Now that we know that all two-person zero-sum games with a finit e soll 


0 ) 

Pure strategies have solutions, our attention turns to methods of finding i ethos 
Hons. Here, at best, the story is quite discouraging. Although Pace gonoul! 
are known for solving games, these algorithms usually require a fantas conic! 
of work, at least for games which purport to be realistic replicas of — apulo” 
of interest. The realism is achieved only at the expense of introducing © 4 late 
number of pure Strategies. One might hope that cases involving SU : refi! 
number of strategies could be idealized by a continuous model and art 
hods could be brought to bear on the idealization. xt appt 
Wy nal! see in our discussion of infinite games in the Pay 
all honesty, we must admit that the number of existing er » de 
in € cases is small, and even in examples that have their ca ate 1) 

cf. dj ite case the usual hope is to reduce them tO ue een ip 

© tee wt. polynomial and polynomial-like games, which 2 
First, Settee features in the solution of many a of strateB pat 
use the oe 4 game may involve a huge num e| down © 
. cal context to help reduce the ™ 


424 


However, in 
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A6.2] 
essentials by discarding many of the inadm 
of the game often leads one to shrewd gu 
cedures, about intelligent starting points 
We believe, and probably most of ou 
tant and interesting games will never be solve 
theory will never contribute anything to the eali 
operandi for a complicated case is to consider an auxiliary gam 1 is 
and related to the original one in such a way that many of the important 


nomena of the original are retained while the auxiliary remains solvable. Fro 
the solution of the auxiliary game one speculates informally how the results are 
modified in the original game. Thus, for example, there are simplified variants 


of both poker and bridge in the literature. Such studies are in much the same 
spirit as economic analyses of idealized Robinson Crusoe or Swiss Family Robinson 
economies which, by means of a lot of hand waving, are used ‘“‘to explain” eco- 
nomic phenomena and to reach policy decisions concerning the economy at large. 
This is dangerous, yes! Yet it is quite stimulating to our creative intuitions and 
often helpful in purely literary, pseudological (not said deprecatingly, but rather 
pragmatically) theorizing. 4 


A6.2 TRIAL AND ERROR 


The most common method of solving games which arise in practice is to 
guess at the solution and then to check that the proposed strategies are in 
equilibrium. Since a pair of strategies provides a solution to the problem, 
if and only if they are in equilibrium, the method is foolproof—provided, of 
course, that one either can guess phenomenonally well or is undaunted by 
failure. When one comes across a statement to the effect that “ .. . if 
we try the following pair of strategies, we see they solve the game,” gen- 
erally it is not known whether the solution was arrived at by brilliant 
mathematical insight or by sheer hack work. Usually, it is a combination 
of the two. 

There are a few guide posts, however, for one who indulges in this guess- 
ing game. Let us suppose that (x, y, v) represents a solution to a 
Siven game (x is player 1’s maximin strategy, y‘ is player 2’s minimax 
Strategy, and v is the value of the game). Since M(x‘, 8;) 2 »v, for 
Pane >, n, the strategy yi can utilize with positive probability 
Only those 6; for which M(x, 8;) =v. For if y“ uses a 8; with posi- 
tive probability for which M(x, 8;) > v, then M (x,y) would have 
to be greater than v, which contradicts the optimality of y‘”.! Hence, 
ify uses each B;,7 = 1, 2, - + + , 2, with positive probability, then of 
necessity x must have the property that M(x, 8;) = »v, for j = 1, 2, 
Tasso: Tn many games, y‘”) uses each 6; with positive probability, so it 

‘ This follows from the fact that M(x , y) is an average of the numbers M4 (x, B;), 


ao are not less than v. So if any of them is greater than v and if it is weighted 
lively, then the weighted average must also be greater than v. 
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is usually d upon j (Such an x* can be found Iving g 95) dy 
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not depene UP The procedures for fi Ysten 
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: s equ 3 r 
simultaneou : ‘ 
qualizes a player’s expected payoff over all of Ponent’s st, 1 
e ee _ ®t ~TEDCS strates 

; eniths in some moc _— gj 
have been carried to artistic 2 ork on sa 


7 * 4 at Af ) e 6 
Ae is found such that . 2. 1 
decision theory. Even if an x jis independe, 


of j, one must still verify that x* is os aeeae * 4 pds Sha 
that x* is player 1’s best response fe? 8 foolproof check, sin 
if x* equalizes the possible Eenurns to player “al then certainly y* ig mn 
against x* (as are all strategies for player 2). ‘Thus the pair is in eauil, 


rjum since x* and y* are good against each other. How to find y* islet 
as an exercise in mathematical ingenuity only for the expert and the 


lucky. 


A6.3 CHECKING ALL CRITICAL POINTS 


In Appendix 3, we introduced a geometrical model which is useful in 
solving games. There we considered diagrams such as those shown i 
Figs. 1 and 2. We are mostly confronted with games where player | ha 


(1, 0) x (1) x (0) (0, 1) 
Ic. 1, Diagram for a game where 


A= 
{a1, a2} and B = {81, Be, Bs, Ba, Bs}. 


more than three 
a two- 
ient to 


x ving 
egies, SO we cannot usually depend upoP we 
nal pictorial guide. Nevertheless, 1* ® 

re about in terms of these simple cas¢ ont 


Pure strat 


Ygon. “6s 
ae San composed of pieces of “lines” (PM 3" 
ya bounding (vertical or m > 3), each generated either y oint é 
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Checking All Critical Pot 
A6.3] , 
this minimum function is characterized by 
cay, where X Is a m-tuple (x1, x9, 
and where z is the height of the point from 
points (x, z) of this minimum surface for whic] 
which z = 2) can be “visualized” as a convex p 
which in turn is characterized by its extreme 


(1, 0, 0) 


Fic. 2. Diagram for a game where 


A = {aj, a2, a3} and B = {f3, B2}. 


For purposes of clarity, the front face (xz = 0) of the game cylinder has been removed. 


set shown in Fig. 3 is a convex polyhedron with extreme points, a, b, c, d, 
€,f. We also note that, when m = 2, an extreme point arises as the inter- 
section of two lines; when m = 3, it arises as the intersection of 3 planes; 
and in general it arises as the intersection of m hyperplanes. Hence, a 
Plan of attack is to find all the points, known as critical points, which 
arise from an intersection of any m hyperplanes. (Note: We also have to 
Consider the bounding hyperplanes. Thus, if player 1 has m strategies and 
Player 2 has n strategies, there is one hyperplane for each pure strategy of 
Player 2 and m bounding hyperplanes, or a total of m + n hyperplanes.) 
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Ag 

ano the critical point associated with any m hy an , 
Finding t olving m linear equations, and thi diet at 
equivalent ee trix of order m. Once we havé 7 ort i 
ing a . many of them as intersectior field oo 
we if a above)—that is, one or mor‘ ache rs 
(8. ‘i ad of the point are negative. Kel = an i 


f e 
Fic. 3 


the critical point ¢ has coordinates x" and », and that M(x", 6)) 71 
for all j. Similarly, the critical point b, with coordinates (x), o°), is 
such that M(x‘, 6;) 2 v*, for allj. In this manner, we can verify that 
only the points of intersection a, b, c, d, and e are on the minimum fu 
tion, and by enumeration we see that c, i.e., (x‘”, v), gives player Is 
maximin strategy and the value of the game. 
Kaplansky [1945], who made the first formal contribution to finding 
solutions of zero-sum games, presented an inductive procedure © dele 
mine the value of a game in a finite number of steps. Later Shapley 


! 
| 
| 
| 


XO, 1, 0) 


a. (0, 0, 1) 


Fic. 4 


Snow ons 

all ah ine E Ponstructive procedure to determine all na 

of this ea Maximin strategies) of a game; for a very ae m7 

but this Case il] pees Kuba [1952]. We shall describe it only ae 

game and oO eae the basic idea. Suppose that y! ist . tio8s 

Fig. 4). Thus x, 96) : 2”, 0) is — ee ‘a 
$ a critical point on the boun 


ey 
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A6.3] Checking All 

xg = 0. Restricting ourselves to this bounding 

(x0, x); y) is the intersection of two §; planes, 
(x6, wh, y)) is the unique solution of the syste: 


x1 + X2 
Ifa solution exists to this system, it can be simply expressed in terms ot the 
determinant 
la1j, a2;1 | 
Glia M2584 


and its cofactors. 

To take another example, an extreme minimax strategy sg) eee 
(x), x, x§), where xi? > 0, i = 1, 2, and 3, is detected as part of the 
unique solution to the equations 


15,1 + 42;5,x%2 + a3;,x3 = 0 
1j2%1 1 425.X%2 1 4352%3 = U 
A154%1 + Goj,%2 + 3;.%3 = U 

x + x2 + x3 = 1 


for suitable indices 71, j2, and j3 from the set {1, 2, - + - , n} of indices. 
Again, the unique solution (if it exists) to this system for any specific 
1; J2, jg can be expressed in terms of the determinant 


415, 427, 437, 
@1j, 42j2 F3j2 | 
Qj, 2273 4353 | 


and its cofactors. The extreme minimax strategies y‘° are found in a 
similar way. 

The algorithm consists, therefore, of isolating a// square submatrices of 
the payoff matrix [a;;] and computing for each such submatrix the potential 
xtreme maximin strategy, extreme minimax strategy, and value. All of 
these are expressible in terms of the determinant and cofactors of the sub- 
Matrices. From the set of potential candidates it is then a simple matter 
to check which fulfill the prerequisite equilibrium requirements for a 
Solution. 
forse hapley-Snow procedure also applies to linear programs; see, for 

€, Goldman and Tucker [1956]. However, this procedure is not 


cu i i 
eas competitive as a computational algorithm, even though it is of 
treme technical interest. 
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A6.4 THE DOUBLE DESCRIPTION METHOD 
i 

Motzkin, Raiffa, Thompson, and Thrall [1953] have suggested a gy 

utational method, called the double description method, for determin, 

7 en - lg 

path the value and all the solutions of a two-person zero-sum game wir, 

finite number of pure strategies. Their proced re is also applicable 

\inear-programing problems. In explaining these results, we shall use ty 


geometry discussed in Appendix a. a 

In the double description method the minimum function is built up by 
introducing the hyperplanes associated with the 6,’s (called 8; planes) one 
at a time. First, we consider the minimum function, called the 8, 8, 
minimum function, generated by the @; plane and Be plane. Next, we 


introduce the 83 plane and thus generate the A, 82, 83 minimum function, 


(By, Bo, B3, By, Bs) e 
a minimum function 
(1, 0) (0, 1) 
Fic. 5 


We continue in this fashion until the 61, 82, * °° > Bn minimum functe! 
is generated, from which the maximin strategy and the value of the ga" 
can be read off. 

To be specific, consider a case where A = {a1, @2}- Suppose Wr 
carried the procedure to the point where the (1, 82, °°" ’ Bs mini 
ae is known, and suppose that it is the fanction show? ! 
a oa og ee ncton is, therefore, characterized * 
First, o a on oA we introduce the fg plane (actually, fine 1 aw tbe 
critical points 2 eed “Sah pene es OM — * vt 
lies above all the eae es. a a — bg ¥ 

a, points, then it is not a part of the P I poll! 


minimum f . ica 
unction. However, if it lies below at least 0P° cri pe 


of the 8, - 

y ee se nu 

integral ait of minimum function, then the Bs plane Hent® pt 
€ B1,°--, Bs minimum functior st pe 2 


critical points must be introduc 


m r 
Car ed old one a su 
ded to chara: t ; F and at least one fic 


cl J 

Pose that the 8. plane i minimum function. T° be $P cit 
Point g is een. ane is above b and below c, as in Fig: > 

uced on the line segment joining b a7? © 


. t ‘ tf 
will certai ; s0 jo 
inly be discarded. To find the exact location © t 


e ne 
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we merely find where the line segment joining b and c pierces the f¢ plane. 
This simple calculation can be easily mechanized, even 

dimensions. 

There is, however, one non-trivial complication. Suppose for example, 
that the Bs plane is above b and below d. The point where the sine s¢g- 
ment connecting b and d intersects the 8, plane is extraneous, since tat 
ine segment is not a part of the 61, -- * , 6; minimum function. In 
Fig. 5 the 61, °°"; 8; minimum function is characterized by points 
a, b, c, d, and e, whereas the 81, - °° , 8g minimum function is character- 


ized by points a, b, g,h, ande. For A = {a, a}, it is extremely simple 
to check whether the line segment joining two critical points of the mini- 
mum function is in fact a part of the minimum function. Such pairs of 
critical points are said to be adjacent. But, if player 1 hasa large number 
of pure strategies, there is no picture to help visualize the procedure, and 
so it is difficult to know whether or not two critical points of a minimum 
function are adjacent. In m-space, two critical points are adjacent if and 
only if there are m — 1 planes common to their characterizations. 

This difficulty can be overcome by a bookkeeping scheme. To each 
critical point of the minimum function (for the stage under consideration) 
associate not only its coordinates, but also the 6; planes which pass 
through the point (and thus implicitly characterize it). This double 
description of the point (coordinates plus a recording of the planes which 
define it) enables one to keep track of which critical points are adjacent, 
and so precludes introducing false critical points. 

To illustrate the next point specifically, let us suppose that player 2 has 
ten pure strategies, that we have determined the critical points of the 
81, 82, - - - , 8s minimum function, and that the critical point (x*, v*) is 
highest on this minimum function. We can check whether any of the 
planes 85 to Bio lie below (x*, v*). If not, then x* is maximin and v* is 
the value of the game. So, in general, zt is not always necessary to compute 
the entire minimum function to find the maximin strategy and the value of the game. 

This raises another important issue: since the labeling of player 2’s pure 
strategies is irrelevant, in what order shall we consider the planes in this 
sequential procedure so as to minimize the computational effort? One 
Suggestion is this: at any stage find the maximum critical point of the cor- 
responding minimum function and introduce the 8; plane which is fur- 
thest below it. This plane is easy to ascertain, and, if there is no plane 
below the maximum critical point, we might as well stop for we have what 
We are looking for. 

For some purposes it may be extremely advantageous to know, in whole 
= a Part, the final minimum function. For example, suppose x is 

€ unique maximin strategy of a game with value v, and suppose it has 
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the special property that M(x“, y) =e forally. Thus, xt 
iti h 1 qual; 
lever 2’s opportunities, and so, if play er 1 uses € will Re 
i, when player 2 does not play minimax. rf mies ee 
player 2 is not going to play a minimax strates hevate St 
more than v. This is only possible, however, i! ds playing ia 


min. But player 1, although willing to take 
may not want to expose himself to excessive 
a safe value v* < v and decide 


s¢l More than 
41 


Or example, by 


might set up . it <8 stupidity o 
non-conformity only to the extent of using strateg: uch have a secur 
level of at least v*. To do this, 1 can refer to his minimum function 
determine the set of x values for which the minimum function js above pt 
his deliberations will then be confined to that set. If player 2 ig no 


os 


strictly opposing player, setting up a security level v* < v might be quite 
realistic. See the discussion in Chapter 13 of the Hodges-Lehmann cy, 
terion and also Hodges and Lehmann [1952]. 

Given the minimum function of a game, which presents an analysis 
from player 1’s point of view, techniques are known which shorten con. 
siderably the parallel analysis for player 2. 


A6.5 THE SIMPLEX METHOD 


The simplex method is a computational technique, devised by Dantzig 
[1951 6], to solve linear-programing problems. Since two-person 2010: 
sum games can be reduced to programing problems (of a very spel! 
form), the simplex method also yields a computational procedure for 
solving games. 


The Simplex problem. Wet U’ be the set of all m-tuples u = (M»" 
"+ , Um) such that 


() 4:20, fori=1,2,---,m. 
(ii) W141; + uray; + Ook ae a Und; = b;, for j - Ae toh , 


Find those u belonging to U’ which minimize the index 


eit) + Mas + + CmUm: 


i f ob 
8 a Stl os is a slight modification of the minimization lg 
are ca b a2. The inequalities (ii) of the minimizatio? a im 
is restriction Y €xact equalities. At first glance, one might ‘s cas 
an inequality Would result in a loss of generality, but this is ei ogucl™ 
dummy varj aoe always be changed into an equality by & 
Beek For example, the inequality 


ujay; + Ns i + thn Omj 2 bj, 


46.5] The Simpl 
is changed to an equality by introducing t 
where 
ujQ1j; + s+ + 4,0 

A simplex problem also has its dual ma» 
no longer a nice symmetry between the dual pre 

The dual of the simplex problem. Let W’ be 

- , Wn) such that: 


W14j1 Wot + °° * tt Wn@in S 6:3 fori = 1 


: : yf : ° . e : E 
Find those w belonging to W’ which maximize the index 
154 at.” - “Wala, 


Note that the dual problem does not require the w;’s to be non-negative! 
Also, the m constraints are given by inequalities, not by equalities. 
The duality theorem for the simplex problem asserts: A solution of the 


simplex problem exists if and only if a solution to its dual problem exists, in 
which case, 


™ nr 


. r 
min cu; = ‘max > b jv}. 
win U! 2) win aa 1 


If u belongs to U’, we shall say that u is a feasible solution. If u = 
(uy, U2, * * * , um) is such that u; > 0, then we shall say that u uses coor- 
dinate i. Finally, u will be called a basic feasible solution if u belongs to 
U’ and if u uses at most n coordinates, where n refers to the n equations 
of (ii) in the statement of the simplex problem. It can be shown that: 

1. If a feasible solution exists (which it certainly does for the linear- 
programing problem derived from a zero-sum game, but need not in gen- 
eral), then a basic feasible solution exists. 

2. If a solution to the simplex problem exists, that is, if there is a 

minimal feasible solution (cf. the parenthetical remark in 1), then a mini- 
mal feasible solution exists which is also basic. 
_ One final definition: two basic feasible solutions are said to be adjacent 
if there are n — 1 coordinates which they both use. Hence, if a basic 
feasible solution is modified by eliminating one used coordinate and by 
using a new coordinate, then the modification is a basic feasible solution 
adjacent to the original solution. 

Two assumptions will be made about the simplex problem. The first, 
known as a non-degeneracy assumption,” implies among other things that 


e * The non-degeneracy assumption actually asserts that all n by x submatrices of the 
eee coefficient matrix (n by m + 1) of eqs. (ii) are non-singular. If this assump- 
©n is not fulfilled, mathematical difficulties are encountered in the formal proofs 


ee may be surmounted by perturbating the coefficients slightly so that the assump- 
1s met. 
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solution with a smaller index—provided one ¢ 
} 


3, If no basic feasible solution adjacent to a basic feasible solution y 
has a smaller index than u, then uy‘ is a minimal basic feasible soly; 
In other words, a local minimum is always a global minimum. 

Knowing this, the procedure is now straightforward. Beginning \ 
any basic feasible solution, we proceed along some path from one adjace 
basic feasible solution to the next in such a manner as to decrease the ind 
at each stage. Since, by the non-degeneracy assumption, there ar¢ ot! 
fnite number of basic feasible solutions, the process must terminate 2 
minimal feasible solution. 

In a linear-programing problem arising from a game problem, ! 
always easy to find a basic feasible solution; however, in a general lint! 
programing problem, finding a basic feasible solution is non-trivial. | 
“Sete by an iterative procedure which leads from a feasible" 

asic feasible solution and which is analogous to the procedure ou" 

“send - a basic to a minimum basic feasible et r 
dure conver a eee wit a fea ae sible s 
et hnse "ee iS depends both upon the initial basic oe “al 
since several .. the particular path taken. The path is ea ; 
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Wo 


programing. There, following Dantzig’s presentation 
simplex technique, Dorfman [1951] illustrates its us: 
game problem. 


A6.6 A GEOMETRIC INTERPRETATION OF THI 
AND DUAL SIMPLEX PROCEDURES 


Lemke [1954] offers neat geometrical insights into Dantzig’s technique, 
as well as into his own variation of the simplex technique. TT! 


Se eae M2 
ineseé Can oe 


Fic. 6 


presented graphically when n = 2, in which case the equalities (ii) of the 
simplex problem take the form: 


Gani + Geitlo 4 + benim = 61 
@j0U1 + dootte + * * > + dmotlm = bo. 


Denote the points (aj1, a2), fori = 1, 2, -- +, m, by a;; then a basic 
leasible solution exists if and only if the point b = (1, b2) is a linear com- 
bination, with non-negative weights of the points aj, a2, °° - 


B “aa; = tone + + + + dam, 


where u; > 0, for i = 1, 2, ---,m. Geometrically, this means that 
a feasible solution exists if and only if the point b lies in the convex poly- 
hedral Cone generated by the points aj, a2, °° *, am. For example, 
feasible solutions exist for the case shown in Fig. 6, but not for that shown in 
Fig. 7. Indeed, in Fig. 6 a basic feasible solution exists which uses only 
ordinates 1 and 2 (i.e., points a; and ag) and another which uses only 
ae 3; however, none exist using only 2 and 3, since b does not belong 

€ cone generated by a2 and a3. 
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Each of the inequalities 


geometry of the dual 


way, + W242 a fy fori = 1 
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can be depicted by drawing the line w1@i1 + % 
Fig. 8. The points (1, we), which satisfy the in 2-space, a. 
below this line. The set of points satisfying t} sae lie on 

equality forms 
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Line: W109) + W2820 = C2 


Line: W1a5) + Wea59 = C5 


as 


Line: W1@3) + W230 = C3 
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wi) is the solution of the dual of the simplex Problem, i.e., there exists 
a@ number y such that 


wp, + w§be = p, 
and 


wb, + wobe < Ms for all (wy, W») in W’. 


We observe in Fig. 9 that the point g is characterized as the intersection 
of the lines associated with a; and a, and that b lies in the cone generated 
Y 41 and a, (i.e., there is a basic feasible solution using coordinates 1 and 

l 4). The other extreme points of W’, h, i, j, and k, do not have the latter 
Property, Specifically, h is characterized by a; and a; but b is not in the 

Cone generated by a; and a;; iis characterized by a2 and a; but b is not in 


nan 
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ee i lated to the duality ti Xtren 
int g of W' is algebraically relate ‘ | 
oin 4 ag 
Pp The point d is characterized by a4 and as, an in the cone. 
erated by a4 and a;. Thus there exists a basic tlotden a be 
and a5, but it is not minimal since d : not in J € also exig ba 
i i i nd a, (characterizin i asl 
feasible solutions using 43 4 me : ing ay and, 
(characterizing f), and using a4 and a (characte! 
The simplex technique might start, for example, . the basic fea 
solution which uses a1 and a3, i.€., with the point e. Since e nea 
belong to W’, it is not minimal. The technique instructs us to ae 


an adjacent basic feasible solution with a lower index, i.e., to either f org 
If we go to g, we are done; if we go to f, then one more step takes us to 
since from f the index is only lowered by moving to g. 

Lemke suggests a variation of the simplex technique, known as th 
“dual simplex method,” which involves moving from one extreme point 
of W’ to another in a manner that increases the index at each stage. Fu 
example, suppose we begain at the extreme point j of W’. The pointji 

pie, supp: g P J point j 
not a solution since the cone generated by ag and az (which characterize 
j) does not contain b. We move from j in a direction perpendicula 
either to a2 or a3 until we come to the next adjacent extreme point—eithe 
1ork. (Inm-space, we move in a direction perpendicular to m — 1 ofth 
normals characterizing the extreme point.) Suppose we move to k; the 
we can go either to j or g. Since we have just come from j, we would 
Surely go tog. But suppose we had begun at k; then we would want! 
. uM to go to g, not toj. This is done as follows: Change thes 
oron sie . ° ° 1c] 
ae € of the normals characterizing the point, in this case a3, so that pisint 

© generated by —a3 and ay. Then, move in the direction orthos™ 

to the normal whose si = We se 
that iki gn was not changed, i.e., orthogonal to a4. ; 
at this takes us to g. Hadweb h Pe ha cone of — as all 
Bae e sane ont, een ath, then b is in the cone 0° oe 
the ve orthogonally to a; and again move tog. Inmdime 
Process is similar, If Was ; the norma 
characterizing our po; ee ome generated by ion. 
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not, changj oad Pome, we have a minimal basic feasible solu"? ’ 
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when actually computing the solutions of specific games. The 


could be ‘mechanized’ with relative ease, both for ‘digitai’ an 
‘analogy methods.” (Brown and von Neumann [1950] p. 73.) 

Their procedure applies to games which are symmetric in the 
that the two players have the same number m of pure strategies and 
M(a;, B;) = —M(a;, 8;), fori,j = 1,2, ---,m. Although the method 
is not directly applicable to non-symmetric games, it would be if we could 


symmetrize them, and then interpret the Brown-Neumann solution of 
the symmetrized games in terms of the original games. Hence, after dis- 
cussing the procedure for symmetric games, we shall describe two pro- 
cedures for symmetrizing other games. 

The value of a symmetric game is zero, for, if player 1 had a strategy 
guaranteeing at least v > 0, then by adopting the corresponding strategy, 
player 2 could hold 1 down to at most —z, which is a contradiction. 

The Brown and von Neumann procedure goes as follows: Begin with an 
initial (time 0) mixed strategy [y1(0), y2(0), - - * , ym(0)] for player 2. 
If it does not hold player 1 down to at most 0, a “‘force” (to be formalized 
by a system of differential equations) is exerted on the strategy which 
tends to bring it “closer” to equilibrium. Thus we conceive of a continu- 
ous time path y(t) = [yi(¢), yo(t), - -- , ym(¢)], where y;(¢) 2 0 and 


yi(t) = 1, of mixed strategies, which will be constrained so as to move 
i 


toward a solution. That is, if v(t) denotes the most that player 1 can 
obtain when player 2 uses y(t), then we want to constrain y(t) so that 

1. v(t) approaches zero as time ¢ increases indefinitely. 

2. If, for any fo, y(to) guarantees that player 1 gets at most zero, then 
y(to) is in equilibrium in the sense that there are no forces acting on 
y(to) to move it. 


Our problem is to set up a differential equation whose solution has such 
properties, 

For the strategy pair [a;, y(t)], let u;(t) = M[a:, y(t)] (= the return to 
Player 1 corresponding to this pair of strategies). We observe that y(t) is 
4 minimax strategy if and only if u;(t) < 0, for i = Eh Di lala a, 
Define $;(t) = max [0, u:(t)]. That is, ,(¢) is 0 if a; gives a non-positive 
return to player 1 when used against y(é) and it is 1’s return when that 
retarn 18 positive. So player 2 wishes to find a strategy y(t) such that 
$i(t) is zero (or very small) for all i. One index of y(t)’s ability to hold 
down Player 1’s payoff is $(t) = ‘ gi(t), for, if (t) is very small, $;(2) is 

i=1 
Small for all i, and conversely. Brown and von Neumann require that 


y(t) move along a path dictated by the following set of differential 
€quations: 


in rections for both 


_ 
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dy:(t) = ¢;(t) — p(t)yi(t), e=1, , mM, 
dt 
They show that: 


(i) x(t) 2 0, for all 7 and all ¢. 


Gi) ) (0) = 1, for all 


t=1 


2 


Gi) dul) < e/(t + 1), where « = ))9.%(0 


Properties (i) and (ii) show that the path y(/) can always be interpret 
as a mixed strategy, and (iii) says, among other things, that by using y( 
player 2can hold 1 down toat most¢/(1 + tc). This quantity approach 
zero as ¢ increases indefinitely. ‘The number « reflects the efficacy of the 
initial guess y(0). The path y(¢) need not necessarily approach a lini 
but may oscillate among limiting points, all of which are minimax strate. 
gies for player 2. Since the game is symmetric, optimal solutions f 
player 1 are identical to those for player 2. 

Bellman [1953] presents a variant of the Brown and von Neuman 
differential equation approach to symmetric games; he claims this yieli 
an exponential rate of convergence, which of course is much faster tha! 
the rate of 1/t of the Brown-von Neumann process; however, a questo 
has been raised and not yet resolved about a crucial step in Bellma"' 
argument. 

H.N. Shapiro informs us that he has also obtained some new resulls 


get a 
Fates of convergence, but these were not available to us in written fom 
the time of writing, 


A6.8 SYMMETRIZATION OF A GAME 
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mann (cf. Brown and von Neumann [1950)): 
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Smith, will be ie Tiption. The opponents, Mr dit 


of 

d the roles of players 1 and 2 in (A, B, M4) ‘yt 

; n: if it turns up heads, Jones plays 1’s role an® ©, i 
symmetrizat; Jones plays 2's role and Smith 1’s role. Th's 5 ot 
ke chess a fair game. Certainly ad i 
” ati? 
ide ° 5 fo! 


Ps the simplest symmetrization of a non-symmetric game 
ue to von Neu 


t of 
Mir. Jones in «ph. YS 1 and 2 in (4, B, M). A pure su” 4) 
% 1 th sy: sey ‘ (ais 
abe % auxiliary Same can be identified with a P4 


oe 


A6.8] Symmetrizati 

pure strategies in (A, B, M). The interpretation is that Jon 
a, if he is player 1 (i. e., if the coin turns up heads) if 
(i.e. if the coin turns up tails). Hence, both Jones 


pure strategies. ‘This symmetrization appears to in: 
number of pure strategies involved; however, | 
showed that the resulting system of mn simultaneou 
reduced to an auxiliary system of only m + n simulkt: 
equations. 

Gale, Kuhn, and Tucker [1950 a] have given a direct symmetrization 
involving only m + n + 1 pure strategies; it is intimately related to the 
reduction of a game to the symmetric form of its associated linear-program- 
ing problem (cf., Appendix 5). This is their procedure: For the given 
game (A, B, M), form player 1’s associated linear-programing problem. 
As shown in Appendix 5 (cf. p. 409), this will be a problem of the minimiz- 
ing variety where the parameters 6;s and ¢,;’s are all1. But associated in 


turn to this linear-programing problem is a two-person zero-sum game 
(cf. p. 419), which takes the form: 


ig Okt a Tog “pepeen aap 1 
= ayo —da99 <iere —Am2 1 
0 
Se pees es @mn | 
ay} aie Ain |} rit 
21 422 Zon | cae 
0 
vat sae aed Gm? amn | 4 
S| —=4 ait ae eae 1 1 0 
Cleari 


j y, the induced game is symmetric, and an optimal strategy for 
either player yields optimal strategies for both players in the original game. 


A verbal interpretation offered by Gale, Kuhn, and Tucker of their 
‘symmetrization is quite interesting. 


hy consider a game in which the players are denoted by white and black. 
= a that white has an advantage (i.e., if white is the first player, v > 0). 

“i 4 symmetrized game is given by the following rules. 
aoe fo choose independently to play white or black, or to hedge. If they 
calouw € rns colors or both hedge, the play isadraw. If they choose different 
he dee Play of the original game ensues. As for the remaining possibilities, a 
Wins One unit from white and loses one unit to black. 
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[ 
That we AB 
It is evident that 


this is a symmetric game. 


‘ne the symmetrized version { : the 0 

‘ginal game by playing ay Oe eat a 
a Eich for the symmetric game oe ine ng both why” 
if ck with positive probability. fee cyclic any Ossibilities (1. 
pine and hedge), reminiscent ea ke this intyjs, 
Slausible. (Gale, Kuhn, and Tucker [1950 al, p. 83. : 
A6.9 ITERATIVE SOLUTION OF GAMES BY FI‘ OUS PLay 

Brown [1951] gives 
a very simple iterative method for approximating to solutions of discrete yer. 
sum games. This method is related to some particular systems of differen; 
equations .. . whose steady-state solutions correspond to solutions of a gay 


The iterative method in question can be loosely characterized by the 
that it rests on the traditional statistician’s philosophy of basing future decisions, 
the relevant past history. Visualize two statisticians, perhaps ignorant of minima 
theory, playing many plays of the same discrete zero-sum game. One migh 
naturally expect a statistician to keep track of the opponent’s past plays and, in tr 
absence of a more sophisticated calculation, perhaps to choose at each play th 
optimal pure strategy against the mixture represented by all the opponent's pa 
plays. [Page 374.] 

Before describing this process in detail, we define the auxiliary notio! 
an empirical mixed strategy. Suppose that, in & iterations of a game, pld}t! 
1 used the pure strategies (a, a”, - - - , a), where a denotes th 
pure strategy he used on the ith iteration. If r denotes the number” 
me pure strategy a; appears in the set (a‘”, a”, - + - , a*)), then le 
x" denote the randomized strategy in which pure strategy occult 
with probability r/k. For example, if in five iterations player 1 “°" 
O25 Oa, M2, 3, then x‘) = (2400, 24a3, Kay). The strategy x is ae 
2g HnVer 1, hh revlts from tn 
y® for player 2 ee, : eecra), The empirical mixt “ses 
B, + -- B®) is g Re fRE Sequence of pure aes 

Peo, ... c ned similarly. - i 
described b ure, which applies to iterations of a fixed § 

y the following steps: 
Step 7, (2) Player 1 
| x) = ga), 
(6) Player 2 chooses a 
(i-e:, against al). 


tions : 
to follow, is ambiguous because of 2° 


Cho 
. Os€ any one of the possible pure strategies comp4 
Step 2. (a) Cy 
- (4) Choose g(2) 
a . 2 : (D), 1, 
(6) Choose g(2) best against y\ (ie., against BY). 4!” 
best against x) corresponding t0 (7 ’ 


(1) he 
chooses a pure strategy which is called « 
(1) r 


inst ® 
pure strategy 6‘) which is best _ ast 
If this instruction, or any 1!" est® 
n-up wit 


. 
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Step 3. (a) Choose a) best against y corresponding to {8", 6 
(b) Choose 8) best against x‘® corresponding to ja’ 
a3), 


Step k. (a) Choose a) best against y“*—» corresponding to {8° 
a}. 
(b) Choose 6“ best against x‘? correspondi: 
a) 


Note that the process possesses only one completely arbitrary choice, 
namely, the starting point a‘. Again, let us denote by v;(x) player 1’s 
security level with the strategy x, and by vo(y) the maximum that player 1 
can get if player 2 uses y. Naturally, for each k 


v(x?) < v < vey), 


where v is the value of the game. Clearly, if lim o1(x™) = lim v2(y“), 
ko k- «© 


this common value must be the value of the game, and so x) and y” 
must be “nearly” optimal for large k. Julia Robinson [1951] showed that 
iterative procedures of this type must converge in the limit, so the Brown 
procedure yields the value zv in the limit. 

In light of these results, finding the approximate value of a game is very 
easy. One merely iterates the game, as we have described, using fictitious 
players, and at each stage the pair of numbers [oi(x™), ve(y)] are 
calculated. Since v lies in between them, the process is terminated when 
the desired degree of accuracy is attained. 

Brown’s results are not only computationally valuable but also quite 
illuminating from a substantive point of view. Imagine a pair of players 
repeating a game over and over again. It is plausible that at every stage 
a player attempts to exploit his knowledge of his opponent’s past moves. 
Even though the game may be too complicated or too nebulous to be sub- 
jected to an adequate initial analysis, experience in repeated plays may 
tend to a statistical equilibrium whose (time) average return is approxi- 
mately equal to the value of the game. 

Note, however, Brown’s procedure requires each player to use pure 
Strategies which are best against the empirical mixed strategy defined for 
all preceding pure strategies of the other player. It is natural, therefore, to 
ask: what happens if the players have only finite memories (i.e., can only 
remember the preceding p moves, say)? For example, if = 1, each 
Player chooses the best strategy against the pure strategy just used by his 
°Pponent. This procedure need not converge, and it is conjectured that 
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mory procedure will w : 
1 will not work, consider the fol Ms 


ork for a 4 
no finite me 
memory of order 


1 
lO & bo hey 


Starting with a1, we get 63, since It Is De 1, then a, 


(Dect 
against B3), then 1, then a1, and the process ah } wever, lou &) 
in equilibrium, and a» 1s uniformly better tha he empirical my 
strategy (4gan, 14a3) which arises from the iterati. € process. 

A related criticism of Brown’s procedure is that it gives the initial pure 
strategies as much weight as the later ones. Since the earlier Strategie 
were not pitted against as much accumulated knowledge of the opponent, 
strategies, it would seem that they should be minimized. What is called 
for is a damped memory—but not so precipitous a damping as a finite 
memory. The discrete analogue of the Bellman differential equation 
method for solving games (mentioned on p. 440), which is obtained if the 
differential equation is replaced by a difference equation, has a damped 
memory effect and furnishes a new iterative algorithm which converge 
faster than Brown’s statistical iterative procedure. The Bellman pr 
cedure is applied directly to symmetric games; therefore other gait 
must be symmeterized first. 

Von Neumann [1954] describes still another numerical method fo! 
solving two-person zero-sum games. Given the matrix [a;;], ¢ = I, h 

Sepeeandy’= 1,2... , N, it is desired to find: (i) an m-tuple x = \"" 


™m 
i 2 %m), x; > 0, for all 7, and > x; = 1; (ii) an n-tuple y = (yu) 


t=1 


"* sIn)s¥5 2 0, for all j, and > yj; = 1; and a value v such that 


j=1 
% 


and 


iv 
(iv) one ool a 
MY tar | 


ae 


x > mM. 


The von Neuman 
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hing (i) and (ii) a eure begins with a trial triplet (x's Y sw) fro? 
tri » and an iterative d : ‘bed for gol fr ch 

Iplet such th Procedure is prescri W 
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satisfies (iii) and (iv). To each triplet (x’, 
assigned to indicate how “close” it is to a so 


om aan 
¢’ = y (v’ — * asx.) * As ) f ' ; _ 
re Pee 7, \ /. Fi } 
j t t 
f . ¥ Soe 1° . , , 
where the sum is taken over all j-indices for which the inequalit) 


j 
i. oe . . daar. 5 1 ° 7 ;* 
fails to hold, and ») is taken over all z-indices for which the inequality (iv) 
i 


fails to hold. It can also be written 


= 
¢’ ca y v,!* ab \ u,!” 
=) 


Ly ; 
j C 
where 
= / / 
( 0, if ) 5 5X; Zee 
’ i 
i. 
j 
iy , -- \ ' , 
| v— L, Uses if ) Ay jX; = Os 
i i 
and 


0, if ayy; <0! 
3] 


> ayy, =; it ) ary; 2 vv. 


J j 


_Starting with the triplet (x’, y’, v’), von Neumann defines the triplet 
(&’, ¥’, 6’),"where 


t 
~7 u; 
yy = : 
2 th’ 
k 
, 
~ / wr v5 
5 ae ’ 
, 
k 
rid = ) a;,%,;'9,;'. 


4,9 


Observe that the constant relating u;’ to %,’ is so chosen that ¥’ = (%1’ 
be fe )isa Brehabiity. vector. Note also that the size of u,’ deisends 
oe Sa player 1’s ith strategy is against y’ relative to an “‘aspira- 
wees s 2 Bs so to speak, ‘The von Neumann recursive procedure then 

» Y , wv’) as a suitable weighted average of (x’, y’, v’) and 
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> (x", ys gl?) — (x®, yo’) 
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nd if we let o\” be the index associated with the Ath iterate, it — 
2 . 
shown by elementary means that: 


(i) gi) > o°” Z o'* ss 


and 


m n - 5 
(ii) o” << a [max aiz a a;;] 4 
h z, 7} uJ 


Hence, $“) — 0 as h increases. 
Von Neumann [1954] indicates how the above technique can be pro 


gramed for a machine, and he calculates the operation time of each iterate 
We quote: 


In evaluating the method it ought to be compared with G. Dantzig’s “simpls 
method.” In the latter method the a priori guarantees for the length of we 
tion and size of numbers are considerably less favorable than ours, but the ane 
practical experience with the “simplex method” indicates that its actual perfor 
ance—at least under the conditions under which it has been so far tested—s ne 
better than the limits that can be guaranteed. Limited comparisons 


| us method s fastet 
eel plex method” again indicate that the latter convers® faste 
but there is reason 


, trick 

hich to believe that our method can be accelerated by al 
7 émount to smoothing the iterative recursive sequence which 18 ; baa 
an making this recursion dependent on several predecessors. The descrip" 


j 2 A . illustt2" 
? pete as a first step in this direction, i.e., in order t© 
the new method, and 


ferred * 
ft to furnish a basis for the possible improvements © 
above. (Von Neumann [1954], p. 115.) ‘ 
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Games WITH 
INFINITE PURE STRATEGY SETS 


A7.1 INTRODUCTION 


An extension of the minimax (equilibrium) theorem to special classes of 
games involving an infinite number of pure strategies was first accom- 
Plished by Ville [1938]. However, particular examples of partitioning- 
like games (deployment of military forces) with infinitely many pure 
Strategies were treated by Borel in 1921 long before Ville’s systematic 
treatment of a class of infinite games. In the last decade, a great deal of 
“esearch has been concentrated on infinite games, the primary motivation 

ing games of timing (e.g., when to fire in an air duel) and games of 
Partitioning (e.g., what proportion of one’s resources to allot to a given 
€ndeavor), In many of the examples, it is suitable to identify the pure 
Strategies with real numbers in the unit interval and pairs of strategies with 
Points in the unit Square. A sizeable literature now exists for such games 
ver the unit square. 

Independently of Ville, Wald published a series of papers [1945 a, 
1945 6, 1950 a] on statistical decision theory in which he developed an 
extensive theory for two-person games having infinitely many pure 
Strategies. Much that Wald did in statistical decision theory using the 
§ame theory he so ably developed can now be accomplished, perhaps more 
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elegantly, without it; but in 
have been achieved so rea 
framework been lacking. 


A7.2 GAMES WITH NO VALUE 


Provided each player’s set of pure strateg! every nem 
(strictly competitive) two-person game has 


i. A value z. 
ii. An optimal (maximin) strategy for player 1 
iii. An optimal (minimax) strategy for player 2. 


To see that this need not be so when strategy sets are infinite, con; 
the following game: Players 1 and 2 each choose a positive integer, s 
and @ respectively. Player 1 receives one unit from player 2 if «> 
zero units if a = @, and he gives one unit to player 2 ifa < 8. Thew! 
idea of the game is to pick a “large” number. This game, as we willsh 
has no value and the players do not have optimal strategies. The pu 
strategy set for each player is the set of positive integers {1, 2,3,» * ‘|, 
a typical randomized strategy for player 1 is a sequence {%1, 4% ' 


Peel 7 
Where > x;=1 and x; 20. This strategy simply requires playe! 
t=1 3? } 
choose the integer i with probability x;,7 = 1,2, --:+. Wenows 


that, for any Strategy x, player 1’s security level, v(x), is —1: SI 


the sum 
i= 


lar . 2 $ * "lee eft 
~ gd re can always render the chance of 1’s winning arbitrarily 
by 


8 choice j j i na 
desires eng 1s Known, 2 can hold him down to as near (0 | 
' us, every x = { 0 
Similarly, for : sity 1 RU vgs tore coca 
9 nyo if a ° 5 om 
near to 1 as h 4 ._peayer 2's strategies, player 1 can ensure a" 
€ desires. Consequently, 


*; can be made arbitrarily small by choosing N suffice 


U1 = epi (x) 
, mee v(x) = —1 <v. = min ve(y) = +15 


; y 
ence the game has no value 


Furth 
€rmore . 
are selected igs Player 1 the Strategy in which the integers 1. ” ii 
: Probability zero, and the integers k + i, with pm ‘os! 
better than f x 4 =" 10,0, --- Es }, 18 ae # 
| tter ‘ ') In the sense that the former is at least a tb 
. or any 7 say 
latter jg inadmissib any y and strictly better for some: no? 
aus€ it is dominated by the former): 
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A7.2] 
For such a game, therefore, one can reasonal 
shall) that there is no “value” in ascribing a \ 
Although one might take the opposite view, arguing that the perte 
metry of the game should result in a (generalized) ‘“‘value”’ of ze 
not proved to be a useful definition. 

We still wish to establish a suitable abstract definition of value for som« 
games with infinite strategy sets. A game is, as before, a t iplet (A, B, M), 
where A and B are players 1 and 2’s sets of pure strategies, respectively, and 
Misareal-valued function defined for all pairs (a, 8), where a isin A ands 
inB. If(a, 8) ischosen, player 1 receives M(a, 8B) and 2 receives — M(a, 8). 
No assumption is made that A and B are finite. We further assume that 
from A[B] a set X[Y] of mixed or randomized strategies is generated’ hav- 
ing x[y] as a generic element. Corresponding to the pair (x, y), the 
expected return to 1 is denoted by M(x, y). 

For the game (A, B, M) with the mixed extension (X, Y, M), the fol- 
lowing terminology is employed: 


i. The game is said to have a value v if, for every positive e, however 
small, player 1 has a strategy in X which guarantees him a return of at 
least v — ¢ and player 2 has a strategy in Y which limits 1 to a return of at 
most v +. Such a game is said to be strictly determined. 

ii. A strategy from X is said to be maximin if it guarantees player 1 an 
amount v, and if no other strategy will guarantee him more. 

iii. A strategy from Y is said to be minimax if it guarantees that player 1 
receives no more than v2 and if no other strategy in Y will hold 1 down to 
less, 

There are, by now, classic examples of games, both with and without a 
value, where neither, one, or both players have optimal strategies. If 
players 1 and 2 have maximin and minimax strategies, respectively, then 
v1 < v9; the equality holds if and only if the game has a value. 


1In the infinite case the set of all distributions need not be a meaningful concept, 


and so some further specification is needed. A number of levels of generality are possi- 
ble, including: 


i. x uses at most a finite number of pure strategies (i.e., for each x in X there is 
a finite subset wz of A such that the probability, according to x, of choosing an @ in 
wy is 1). 

ii. X uses at most a denumerable subset of pure strategies. 

iii. All measures x are densities (with respect to some specified dominating measure 
Over a suitably chosen field or a-algebra of sets). 

iv. x is a completely additive measure over some o-algebra. 

Vv. x is a finitely additive measure with all sets measurable. [With this convention, 
the value of the game “to select the larger integer” (discussed on p. 448) is zero.] 


a Naturally, whether or not the game has a value or player 1 has a maximin strategy 
pends upon the definition of X we choose. 
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A7.4 GAMES WHERE A IS “ALMOST 


Of considerable significance are those games where player 1 can restrict 
his randomization to a fixed finite set of pure strategies with the knowledge 
that he will sacrifice less than €, say, regardless of what player 2 does. To 
be specific, suppose € is given (e.g., it might be « = 10-°). Then, we 
assume there is a finite set A, = {ai,, Qin °° * 5 a;,|, where r depends 
upon e, such that for each x of X there is a “matching” randomization x* 


over A,, with the property that 


|M(x, y) — M(x*, y)| <« for all y in Y. 


We can think of A, as a finite e-approximation to A. 

Now, since A, is finite, the mixed extension of the game (A,, B, M) hasa 
value and player 1 has an optimal strategy. By letting « approach 0, it can 
be shown that the mixed extension of (A, B, M) also has a value. How- 
ever, player 1 need not have a maximin strategy—its existence depends, 
in part, upon how general we choose to make X. 

Speaking very roughly, if A is finite, then a value and a maximin 
strategy both exist; if it is not finite but can be approximated by a finite set 
in such a manner that the sacrifice is arbitrarily small, then a value again 
exists, but a maximin strategy need not. 


A7.5 GAMES OVER THE UNIT SQUARE 


The infinite games which have received most attention are those whose 
pure strategy sets are the real numbers in the closed interval from 0 to 1, 
Le. 


A = {a|0 
B= {8|0 


If M is a continuous function on the unit square {(a, B)|0L a <1, 
0<BxK 1}, then, for every € > 0, a finite «-approximation to A can be 
found by choosing sufficient points scattered uniformly over the unit inter- 
val. Thus, all such games have a value. Players 1 and 2 have optimal 
Strategies provided X and Y are taken to be the sets of all cumulative dis- 
tribution functions over the unit interval [case (iv) in footnote 1]. 

More refined problems are: (i) Is it possible to characterize the optimal 
Strategies for games with a continuous payoff over the unit square? (ii) 
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moments of a cumulative distribution function F over the unit inte 
changing 1 to 2, m to n, and F to G, an analogous statement holds for 


Because of its increased versatility, 


where 7; and s; are continuous functions, seems to be a more promising 
approximation to an arbitrary continuous payoff. Games with payoffs of 
this type are called polynomial-like. For given m and n they can usually 
approximate an arbitrary continuous game more closely than can the 
polynomial games, but this advantage is offset by the fact that at present 
much sharper results are known for polynomial games. Dresher, Karlin, 
and Shapley [1950] show that in every polynomial-like game both players 
have optimal mixed strategies which use at most min (m, n) pure strategies. 
Finally, Blackwell and Girshick [1954, p. 54] have given an example of a 
continuous game over the unit square in which every optimal strategy uses 
all of the pure strategies. If X and Y are restricted to randomizations over 


at most a denumerable number of pure strategies, then their example has 
a value, but no optimal strategies. 


A7.6 GAMES INVOLVING TIMING OR PARTITIONING 


Games involving either timing or partitioning often can be reduced to 
§ames over the unit square, but in contrast to those we have examined 
Previously the payoff function is not continuous. To illustrate how the 
discontinuities arise, consider the game in which the two players—duelists 
are separated by a distance of two units. At a signal, they begin to 
approach one another at the same constant uniform rate. Each is at 
liberty to fire a single shot at any time he desires. A pure strategy for 
Player 1 [player 2] is a number a, 0 < a < 1 [8,0 < B< 1]. This is 
interpreted to mean that 1 [2] fires when he has traveled a [8] units, unless 
2 [1] has fired earlier and missed, in which case 1 [2] holds his fire until 
they are together. It is postulated that each player knows his own and 

'S antagonist’s probability of a kill as a function of the distance between 
them—these probability functions are assumed to increase monotonically 
and continuously as the distance between the players decreases. If 
Pi(y) [Po(y)] denotes the probability that 1 [2] hits 2 [1] if he fires when 


“y are a distance y apart, then a possible payoff M is: 


a M(a, 8)’ = (1)P\(2e) + (—1)[1 = Pi(2e)], fa <8, 


(c) 


= (1)[1 — P2(26)] + (—1)P2(28), ifa>s, 
= (1)Pi(2a) + (—1)Pe(2a), ifa = B. 


So 


aa << 


j Strategy Sets 
, Infinite Pure 
mes ith w 


[Ay 

Ga > probability al 

. in case a 1 shoots - : oy v2 ey that he hits 

For example, the probability of his being 2 a ue tact P,(2a), Sing. 

P, (2a), ane “4 hold his fire until he cannot m Ne have no With, 
: 2 wl 

{1 misses, 


nction; it 
he realism of this payoff fu is not continuous. Thee: 
defend the atural payoff in a duel is not uc : ere is ap 
tan . i < 6 1 shoots first and if ». 
trate that i along a = 8, since if a ate, nd if ay, 
of od The payoffs on the two sides of the line can dif. 
2 shoots frst. 
appreciably. ith a line of discontinuity in the payoff function have, 
Such games a ers have optimal strategies” (Karlin [1950], p. 144 
value and the p i ralized this example to what he calls a symm, 
i an [1953] gene ; ; > is: 
nis: a game over the unit square where M is: 
game 0 - 


erely Presented to} | 


j. Strictly increasing in a, for a < B and : “ia payoff to | 
improves the longer he waits—so long as he a : * - aa 
ii. Strictly decreasing in 8, for 8 < a and a ‘ : ; 
diminishes the longer 2 waits—so long as 2 es “a - +e aia 
in. M(a, B) = —M(Q, a) (i.e., the player’s capabilitie 


-— imal strategy 8! 

He shows that, aside from some trivial cases, the ae. eal 

randomization of a density type. This density ied ache 
as the solution to a certain integral equation which, 


inary linear differen 
be shown to be equivalent to a system of ordinary lin 
equations. ‘ql games? 
‘ rmmetricat 5° 
Karlin [1953] indicated that the salient feature of a ne i 
timing which leads to an integral equation for the optima , 


ae the ordet" 
diagonal discontinuity in the payoff function arising from 
which the players act, 


The history of 
a paper publis 
Borel [1953]), 


| who, v 
to Borel, 
Sames of partitioning dates back at least 


~ Fecomomelt! 
hed in 1921 (a translation appears 10 i 
Posed the following problem: 


2 : : 

Some interesting Variants of the simple duel are: 
A Each player is 
Pistols have silence 
ii. Only one pla ‘é 
ae yer has 
ii, BE ie ie, a silencer, 


; m bullets and 2 has n bullets. 
WV. ™mbinations of (iii) and (i) or (ii), f 


unaware whe 


ual 
6 pit. 
hen he! 
n his opponent has fired except W 
rs.) 


In cases (j 


“nc ui! 
mye blem, 10 
Probabilities ag the players know all the data of the pro” 


i) that 
; in case (ill) © 
Now the initial really Complicated, we can suppose In C Gis 


el an ye! 
isc ae ot know the initial m. Blackw play al 
USS (iii) in detail under the assumption that the 
ro mand n are known. 


— 


A7 6] Games Involving Timing or Partitioning 455 


. . two players A and B each choose three positi 


is equal to 1, viz: 


and each player arranges the numbers he has choser a determined order. A 
wins if two of the numbers chosen by him are superior to the corresponding num- 


bers of B. 


For example, we can think of two opposing generals with equal forces 
each of whom must partition his own forces among three battle areas w ith- 
out knowing how the other will deploy his. Each aims to have a numer- 
ical superiority at two of the three sites. 

Tukey [1949] and Blackett [1954] studied a more general class of games 
of partitioning called “‘Blotto games.” Tukey analyzed Blotto games of a 
symmetric type, and Blackett examined a specific asymmetric form. 
Without actually defining these games, their nature and possible applica- 
tions are suggested by the quotation from Blackett: 


The particular problem of Colonel Blotto illustrates a general class of ““Blotto”’ 
games in which: 

Two players (A and B) contending on N independent battlefields (labeled 1, 
2, ++ , N) must distribute their forces (F and G units, respectively) to the battle- 
fields before knowing the opposing deployment. The payoff (a numerical meas- 
ure of the gain of A or equivalently of the loss of B) on the 7th battlefield is given by 
a function P,(x, y) depending only on the battlefield and the opposing forces x and 
y committed to that battlefield by A and B. The payoff of the game as a whole is 
the sum of the payoffs on the individual battlefields. 

An interesting mathematical problem connected with these games is the deter- 
mination of their solutions from the payoff functions of the individual battlefields. 
Instead of studying this in general, the present paper illustrates the possible 
applicability of Blotto games to problems of logistics and tactics by analyzing a 
particular problem which may be considered as an especially simple Blotto game. 

Suppose a supply system is to deliver a shipment of material from a rear area to 
an advanced area by one of N independent routes subject to interdiction by enemy 
assailants. (By “independent routes” is meant routes such that a single assailant 
Cannot interdict more than one.) If the route must be selected in ignorance of the 
interdiction plans of the enemy and the enemy must station his assailants without 

nhowing the route the shipment will travel, the situation described may be 
regarded as a Blotto game in which A selects the route (battlefield) for the ship- 

ment (4’s forces) while B distributes his assailants (B’s forces) among the different 
Possible routes (battlefields). In this game, P,(1, y) represents the gain of A (the 
loss of B) on the ith route when the shipment travels the ith route and y of Bs G 
assailants interdict this route. ‘The analogous quantity when the shipment travels 
Some route other than the ith one is P;(0, y). 

N one military interpretation of this situation the shipment is a naval convoy 
and the assailants are submarines. In another the shipment is a truck convoy and 
the assailants are attack aircraft. In a third the shipment is a bombing strike 
against an enemy target and the assailants are interceptors. [1954, p. 55.] 
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A7.7 A MODEL OF POKER DUE T 

Another class of games on the square is desc the following 4 
tation from Kuhn’s extremely fine Lectures 0 ory of Camas i 
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p. 139]: 

A fertile source of examples of infinite games 1s prov! by models of eas 

d games. Indeed, they invite treatment by a continuous variable on .. 
oo First, the combinatorial complexity of finite models precludes nm 
Stpeiion of any but the simplest cases. Second, the natural linear ordering ¢, 


large number of hands in games such as poker virtually invites the Passage to 4 
continuum of hands. Consider, for example, the following model of poker dye 
Borel: 

“77 relance. An ante of a units is required by each of the two players. Att 
beginning of a play they receive fixed hands, s and ¢, chosen at random from th 
unit intervals OSs 51 and0S#21. Then 1 either bets an amount} -, 
or drops out, losing his ante. If 1 bets, then 2 can either see the bet or drop ou, 
losing his ante. If 2 sees a bet, the hands are compared with the higher card 
winning the total wager 5.” 

We shall assume that 1 uses pure strategies of the following form: he chooses: 
number a@ in the unit interval, 0 S a S 1, and decides to bet when his hand 
exceeds @ and to drop out otherwise. Correspondingly, a pure strategy for 2 will 
consist of the choice of a number 6 in the unit interval, 0 S 6 S 1, and the dec 
ce to see any possible bet when his hand exceeds 8 and to drop out others 

€ careful reader will want to verify that every other pure strategy is dominat 


a One of these. He will also remark that the original game had too many 
Strategies to be a game on the square.) 
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SEQUENTIAL COMPOUNDING 
OF TWO-PERSON GAMES 


A8.1 INTRODUCTION 


We first encountered sequentially compounded (i.e., temporally repeat- 
ed) two-person games in Chapter 5, where it will be recalled compounded 
zero-sum games seemed to be reasonably well behaved but compounded 
non-zero-sum games exhibited certain anomalies (e.g., the prisoner’s 
dilemma). Here we shall explore the structure of compounded games 
more fully, using more complex compounding rules than in Chapter 5, 
but confining ourselves primarily to component games which lead to a 


< zero-sum overall game. 
The relevant literature which dates back, even if we are generous; only 
; to 1949, is already extensive. Apparently, the time was ripe in the 
farly *50’s, for independently a number of workers attacked variations on 
a the theme of compounding. Unfortunately, this has led to a good deal of 
redundancy, both conceptual and technical, in the literature. 
a; The central ideas are these: In one class of games (recursive and 


Stochastic) a normalized game is played at each stage, and the player’s 

Strategies control not only the (monetary) payoff but also the transition 

Probabilities which govern the game to be played at the next stage. In 
. @nother class (survival and attrition games) there is but one component 
457 


eo 


Se 


we ——- 
———<——— 


2 selects By), th 
Player 4 receiy 
Comes aeR to 
abilities 0.4 0 Play P? ae 
Outcome r re and 0.1 


8 


Sequential Compounding of Two-Person Games 
458 


d it is repeated. The players pe tn ed babi 

ao te in time according to the ou t | 

Be ce, The overall game is conclude: | neat 

oe Sih still another class (compound 

er fedand each player attermpts ' 

os . ~ the statistical records of his ac 
by explol 


10N problem 


Tesource 


3) ao 
4 gy 
rol the aVerage Da 
Aary'S Previous cho 
The final class (economic ruin games), which on 

8 id policy: 7 : 
ified by the problem of corporate dividend policy: The more pene 
typl 


: 
{AQ 
! 


repeated play Be 


1€ Dave, 
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‘wencies; however, in opposition to this platitude is the truth, impoy 
Fe crencat rates, that a dollar today is worth more than the present vq 
> 


of a dollar to be delivered in the future. 


As we explore these topics we will also indicate some of the interrel; 


tions: how the theory of stochastic games suggested that of recursive ga 
which, in turn, is related to the theory of survival and attrition games; ha 
Blackwell’s approachability theory, which was motivated by attri 
games, can be used to analyze compound decision problems; and ha 


approachability theory is technically similar to a generalization 0! 


theory of survival games. 


A8.2 STOCHASTIC GAMES 


. a5 2 whe 
The first game to be described is a specific stochastic gam 
involves these two payoff matrices: 


Component Game I"! 
; Bi' Bo" ae 
al 4 & (0.48, O5r', Oat) O& (0.28, 0.50 0.9 A 
PP e065, Or} o.4r’) 2 & (0.88, 0.20, 


Component Game I? 
2 


D) By 
RE as, OT’, 


Ba" 1 0.21") 
Ore 2 & (0.65, 0.20. 5 | 
2& (0.18, 0.571 


I’) 
0.472) 5 & (0.35, 0.61, O! 


yers Making 


moir) wg gu 
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c 4 9 
the component game eis Suppose 1 chooses a* and 2 chooses 8)*; then 
the payoff is 2 & (0.15, 0.5", 0.41"). As before, this means that player 2 
gives 2 units to player 1 and a lottery is conducted Niecdvalan slicucaas 
“Stop,” “Play e next,” and “Play I’? next” occur with probabilities 0.1 
0.5, and 0.4, respectively. The play continues until a lottery yields a 
“Stop outcome, and the overall payoff to each player is the sum of his 
payoffs in the component games of the play. 


The play proceeds from component game to component game accord- 
ing to transition probabilities controlled jointly by the players. At each 
trial, each player must consider not only the probable effect his choice has 
on his positional payoff at that trial but also its effect on his chances of 
playing the several component games in the future. Since each lottery 
has a positive “‘stop’’ probability, the play is “‘almost certain” (i.e., with 
probability 1) to terminate in a finite number of steps. 

The generalization is straightforward. The r component games I’, 
i. oe. are given. 
k 


In game I“, player 1 has the pure strategies 
po 
Q1, QQ 


> °° * 5 Qm,* and 2 has the pure strategies B;", Bo", - - - , Ba” 
If 1 uses a;* and 2 uses B;*, the payoff is 


1 k R ( k0¢ kipl k2p2 cages k r 
tj G tj ? pi; > Pij > > pi; pe 
where 


pi;*° > 0, pa? > 0, ford =1,2,--° + ,7, 
and 


pi;*° + pi;* = ae + pi; = 4. 


These payoffs are interpreted to mean that player 2 gives 1 a; ;* units 
and a lottery is performed in which the play terminates with the posttive 
Probability pi;*° and the component game T° is played next with prob- 
ability p;*!, P=1,2,---,r. 

If we let T stand for the collection {r, r?, - + + , I}, then the specific 
Same which begins with I* may be denoted by the pair (T; MY). 

Shapley [1953 d] first defined stochastic games, and he characterized 
their solutions in this sense: for each initial condition, he gave a method 
for finding the value of the game, 1’s maximin strategy, and 2’s minimax 
Strategy. We illustrate his procedure for our specific example. 

First, the given game is truncated at trial n as follows. It is played 
Without any modification as long as it terminates prior to the nth trial. 
But, should it last n trials, then, instead of playing a component game at 
tial n + 1, player 2 gives 1 a fixed amount w{”, if I’! was to be played, 
and wif 2 was to be played. Elliptically, the game is said to be 
A cated at trial n by means of the payoffs (w{”, wi)? If n is large, 

Mtuitively clear that the truncated game is not very different from the 
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e ticular value: ) i 
nal game, and the pat Mo ‘ Which are 
orig) t critically affect the overall va runcated « 
should , t is important since the truncatec te <a 
Bee y. At trial n (if the game lasts t! © payofls an 
Game I! (w{”” w: 
‘i Me | 
he 4+ 0.520 {” + 0.1w>° ww ; ea 
Game T'?(w{, w3””) 
Bi? Bs” 
a” a. 2+ 0.2w}” + aa 
ax” : + 0.50}” + 0.403” 5 + 0.6w{? + 0.10)” 


Note the labeling: the payoff matrix of component game T* at trial 
denoted F*(w{”, ws”), & = 1, 2. 


Let 
1 0) (0 
w°)) = value of zero-sum game By eo”, w)) = val T (wi, w 
2/,,,(0). {0 
w') = value of zero-sum game I'?(w{, wh”) = val P*(wy ; 4} 


We now work backwards. At trial n — 1, the outcome “Play I” nest 
has the value of wy” to player 1,4 = 1,2. Thusat that trial, the plas 
should behave as if they are playing I'!(w"’, ws") and Pwr # 
Continuing our backward induction, for any integer 5, O<s<a- M 


(st1) _ 
wet = val T*(w??, a), for k = 1, a 
In particular, at the first trial we have 


wi) = val Tr n—1 n—1 for k = qu 
k ( : 4 : ry. 
> ant 


) p) 
(; 1?) and w§” are the values of the gam ? 
i 0) 6”). 

oe olan when they are truncated at n by (wj ’ » alll iy 

Ht erations , be Wy 

value transformation: Suggest that we define what may into 


Pair [val T4(y,_ the function T which maps the pair (#» "* «7.0 


2 S 
above induction = a D*(w1, we)] = Tiwi, v2). 1 aie 
€ written: 


Thus we ( 
see that w{" 


T(w” 
Go, wh) = Ww, wy) 


T?(w(0) 0 (2) 
(w”, ws ) = T(w, wi) = (w, ws”) 


T"(a,)(0) 
Ww, w) = eof” ,.20§™). 


a 
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It is intuitively plausible (and 
4. As n increases, 7"(w}’, u 
(wi, wi). We denote it by 


7 


lim 7"(w}" 


n— 0 
Bi (w.*, W2*) is the unique pair 1 
T(w.*, Wo) = (w,* 


ney (w*, we*) is the unique solution of the two equations in two un- 
knowns: 


I 


w, = val T}(w, we) 


| 


9 
we = val ['“(w, we). 


In the general case, (w1*, wo*, 


- , w,*) is the unique solution of the 
system 


wy, = T*(wi, wo, °° * , er); for k = 1, 2, “S 


3. Player 1 has a maximin strategy for the stochastic game which con- 
sists of playing a maximin strategy for I*(w1*, w2*) whenever the com- 
ponent game I“ arises. This guarantees player 1 an expected return of at 
least w;,* for the game (T; Tr’). Changing 1 to 2, maximin to minimax, 
and w;,* to —w,*, an analogous statement holds for player 2. 

Shapley points out that, if player 2 has only one pure strategy in each 
component game, he is really a dummy player and this degenerate 
stochastic game amounts to a dynamic programing problem for player 1. 


A8.3 RECURSIVE GAMES 


Recursive games and stochastic games are closely related, the only 
difference being the form of the payoff functions in the component games. 
Recall that, when 1 chooses his ith strategy and 2 his jth strategy in the 
Component game I“ of a stochastic game, then the payoff is 


Bp ke PS pT pT’), 
where p; ;*° = 0, pij*” 2 0, for! = 1, 2, erst ais and p;;*° y bis oe 


1=1 
Ina recursive game two small, but important, modifications are made: 


1A Payoff of actual units (such as a;;") only occurs when the play 
| terminates, 


2. The probability of stopping, pk is not necessarily positive. 
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eque ‘a 
‘anf payoff in a re 
ey? eat 4, F272 Me 2. 
pe? Cass" and S); pij 2 i) ) Dp : I”), 
1j 
= 2 . af , r a Nf z C ‘) el = le a 
where pei 2 0, for ie = 0, it 5) 9 "9 1. This 
a1] k0 nec ] = ce 
interpretation: With probability /:; (not ne y POStiIeR) gas 
in er ; k 7 7] Dro! ty -. kl ¥ Dla 
stops and 2 pays 1 a;;* units, and a pro Pi; the pane, 
is played next (and no units are exchanged a trial!), for) 
a r, The basic reference for recursive games is Everett [1954] U 
cy 1Ag fs *hastic and ra a ' 
The conceptual differences between stochast ic an recursive game ¢ 
illustrated in the following simple examples discussed by Everett. 
Example 1. 


By Be! 
oa, (vr 1]. 


There is only one component game, and player 1 is strategically ; 
dummy. If B,! is always played, then the component game is repeate 
indefinitely, and so the recursive game never terminates. This te 
possibility must be taken into account by the rules of recursive games 
We shall say that the payoff to each player is zero for any non-terminatix 
play of a recursive game. 

Example 2. 


B,* B.! 
Tr. ay) _ 1 
a | 0 |. 
, By using the mixed strategy [(1 — ¢)a,1, exo) repeatedly, whert 
Se <1, Player 1 can force the play to terminate with probability ont 


ieldj i : nY, 
yielding him an expected return of at least 1 — e, regardless of 2 g strates} 


For, if 1 aig 
oe be eg ony Stage, then the play terminates with probabit 
surely ne fe) return of one unit; and, if 2 uses 62’, then se 
fore, a, aks yielding 1 an expected return of 1 — € a pe 
an A ; oT ie 

suarantee himself an expected return arbitra)” 


; but h. 
’ € Cannot re : mas 
Strategies, ach that value, and so he is said to have ¢ 


| 
or this re 4 tio” 
Cursiv . func 
equati € game, the analogue of the importa™ 


ion Which : 
ar : “v 
ph the analysis of stochastic games 1s: 


the us 
= 1. However, althous 
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min strategy for the game 


maxi 


is a1, its repeated choice leads 
2 can prevent the game from ter: 


Example 3. 


I 
as I ZU | 
i 2 | 
a3 20 I al 


This third example is simple to analyze, since the payoff T’” in com- 
ponent game I’ is trivially worth —10 to player 1. Player 1’s maximin 
strategy is always to use (0a;", 14a2', 143!) in the game T'}, player 2’s 
minimax strategy is (148, 1489) for component game I}, and the value is 
+5. By denoting the payoff for (a3', 62") and (aq, B11) as P2, not —10, 


several points about recursive games are easily demonstrated: 


i. In general, the solution of a recursive game cannot be obtained as the 
limit of solutions of truncated games. For suppose that this recursive 
game begins with T'! and that, when the play does not terminate by the 
nth trial, the payoff is zero to both players. This truncated game has 
value +10 to player 1, since he can play a; for the first n — 1 trials and 
(Oa?, Way), a3!) at trial n. Regardless of player 2’s strategy at trial n, 
I's payoff is the lottery (14 20, 14T?), which is worth 10 units to him since 
the outcome I’? on trial n is worth only zero when the game is truncated at 
n. But, as we noted, the value of the recursive game is 5, not 10. 

ii, A fixed point of the “value transformation” does not necessarily yield 
the value of the game. Also, there is not necessarily a unique fixed point, 
and, therefore, we cannot conclude (as for stochastic games) that, starting 
with any initial point, repetitions of the value transformation will neces- 
sarily yield a sequence converging to the value of the game. As an illus- 
tration, observe that the system 


Ww, Wy, 
w, = val|we 20) we = val i era TA 
20 we 


has (w1, —10) as a solution for every w; 2 5. Furthermore, if we start 
ug (w{?, w”) = (0, 0), then we get T(w{”, wh) = (10, —10) and 
Fe”, 5”) = T(10, —10) = (10, —10), etc. Thus T*(0, 0) con- 
verges to (10, —10) instead of the value Gr= 10): 


464 Sequential Compounding of Two-Person G 

These examples establish that, mathematic | ere must [Ag 
siderable divergence between the theory of stor ind recurs aie 
Nevertheless, Everett [1954] shows that every , ae Bam 
and that value is 4 (not the!) fixed point of th: vita be a val 
players do not necessarily have maximin an issn Pee iy 
Everett characterizes what may be called thei timal dona 
results, although more complicated than tho " abasi a H 
be outlined as follows: enn 
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. 
(w1, iy, ara ts) — (w’, we’, el gn): 
where 
wy’ = val T*(w1, wo, °° * 5 @r); | a i ee 
has the properties! 
(i) wie > Why whenever w;, > 0, 
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for k = iy 
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for player 2 is: the r-tuple a ahs ha least w;. The analog 
1, We, * * * , w,) is obtainable by 2 if 


(i) > 
Gi Wr < Wk Ww henever Wk << 0 
ii) W, < w Ww V 
’ 


for k = 1 2 
A DE aia) ge ot? - 
begins with re fied by If (wi, - - - , w,) is obtainable and the 
b » We 


layi  % “4a 
Whenever the com Playing a minimax strategy of pew, 0 
can Suarantee that {ee game T* occurs. = 1.2,°°°> P playet’ 
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Definition. The Player 1 gets an expected return of at most ig 
vector if, f r-tuple (w{”, {0 Wile oi anil be 4 ott 
) or each e > 0) ho fae, + wi! ) is said to ' 
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if wi< li) j 
1, In exam 5 
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A critical vector (if it exists) is unique, and it is inte: preted as the “‘valu 


of the recursive game. 
9. If the critical vector exists, then it is a fixed point of th Aue 


. . r voy 0) P 0) is 
tormation” T, i.e. Tw), wy (on). a! Ge 


3. Every recursive game possesses 


p Although formulating the definitio tan obi abie vectol is delic te, once it is 
done correctly proving the assertions in 1 is a straightforward analytical matter. 
Assertions 2 and 3 are somewhat deeper. First, consider a recursive game with just 


one component T! ie., I’ is played repeatedly until termination occurs (if it does). 
Let m and M be, respectively, the minimum and maximum payoff entries in I"'. 
We shall consider val ['4(w) as a function of w, which can be plotted as in Fig. 1. 


Fic. 1 


The following can be shown: val I'!(w) is monotonically non-decreasing in w, it is 
Continuous, val '4(m) > m, and val T'4(M) < M. These four properties imply 
that the graph of the function for m < w < M must cut the 45° line which passes 
through the origin, i.e., there are solutions to the equation w = val Ii(m): pda 
Fig. 1, w* is a critical vector since, for any € > 0, w* — ¢€ is obtainable by 1, ie., 
val T(w* — ¢) > w* — ¢, and w* is itself obtainable by 2 (ie., val M4(w*) < w* 
andw* >). So the recursive game has value w*, 2 has a minimax strategy, and 
1 has €-maximin strategies. (A set of inequalities, which we have not presented, 
rules out the possibility that the graph again crosses the 45° line as w increases. 

hus, certain potential pathological examples do not really plague us.) 

Next, we consider a recursive game having two components, T'! and I’, and now 
We let m and M be the minimum and maximum entries of the two payoff tables. 

We were arbitrarily to substitute the number w2 for T?, then the given two com- 
Ponent recursive game would collapse into the one component recursive game 
As we know, this game has a critical value which, of course, depends upon 


2 To show this dependence, we shall denote the critical value by w1*(w). 
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that line at w ? 2fw1*(we), we) versu - 0 above, 
‘ncreases. A plot of val P*[w1 (wz), w2) versu nular in form tp 
fe wor be the point closest to the origin where t! Crosses the 450 a: ty 
point exists because of the properties we have ted). Our da : 4 
[wi*(we*)s w*] is a critical vector of the recurs e i 
. s at Hc ro L E ° 
To prove this assertion, one maust show that | * ‘$s have (different) ve 
which are obtainable and which lie arbitrarily close to [w1*(w»'), ns 
illustration of how this can be done, we will take the case where w1*(w,4) er 
w2* > 0, and we will only worry about a vector obtainable by 1, Choose y.! 
*, 1 tags fos, FY ¢ 4) IK 
low and “‘very close”? to w2*; then by continuity w;*(w2") is very close to w,*(y,t 


and, since w2* is the point nearest the origin where the curve cuts the 45°} 
val T'2[w1*(we’), we] > wy’. 


Now, #1*(w9’) is critical for the reduced game I"! when wy’ is substituted for)! 
therefore, there exists a wy’ arbitrarily close to w,*(w2’) which is obtainable in} 
reduced game, namely: 

val T'(w 1’, we’) > wy’. 


Finally, we note that wy’ is “close” to w1*(w2*), and, since wy’ is also clos 
w1*(we'), continuity implies 

val T?(wy’, we’) > we’. 
Therefore, (w,’, wo’) is obtainable by player 1. 


cs Everett Gives a rigorous inductive proof that a critical vector exists for yy 
Cursive game which follows along the above lines. 


ee 154) actually proves results more general than fr 7 
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for the same strategy pair. Since p,,*°> 0 (a 5, 


stochastic games), the quantity a,;"/p;,"° is well defined. We n 
in the stochastic game the payment of 


. vay 
s ( 


in the related recursive game, the expect 
units. Also, the transition probabi 


AELAGC 


outcomes “Stop”? and 
“Play 7 next’’ are identical in the two games. Conseque ntly, the stra tegic 
analyses of the stochastic game and its related recursive game are the same. 


In the theory of stochastic games, if the assumption that p;;*° is strictly 
positive for all 7, 7, and & is dropped, a value does not always exist. This 


can be illustrated by a simple example. In the game I!: 


By" 
ay" [1 <i ny, 


player 2 gives player 1 one unit at each stage and the game is repeated. 
So the “value” (if it can be said to exist) must be infinite. If the meaning 
of “value” is extended to include any finite number plus “‘numbers’’-+ 0 
and — 0, then it is easily shown that for one-component stochastic games 
an “extended value” always exists—even if pi;'° = 0 for some i, J; how- 
ever, this result is not generally true when there are two or more com- 
ponents. For example, in the game: 


ComponentI'? Component I? 
By" By” 
a f1+T7] a? [-14+Tr) 


Players 1 and 2 transfer one unit back and forth, and so a “‘value’’? does 
hot exist in the usual sense. However, if all the a;;* are non-negative (or 
non-positive), an extended value does exist. These games Everett calls 
univalent, and one-component games he calls simple. 


A8.4 GAMES OF SURVIVAL 


Games of survival are one of the possible generalizations of the classical 
Sambler’s ruin problem: Two gamblers initially have r and R — r dollars, 

and at each flip of a (not necessarily fair) coin the loser pays the winner a 

dollar. The game is terminated when one of the gamblers’ capital is 
exhausted—when he is ruined. Centering attention on the gambler with 
initial Capital r, let p be his (constant) probability of winning a dollar and 
tee |b his probability of losing a dollar when the coin is tossed. Let 
7cnote the probability that he is ultimately ruined. It can be shown 


' 
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Feller (1950, P- 283) 


(cf. 


) that Ad, 


ews), =, | 
(q/p)® — 1 ae 

ao ae, RS 
Cm 9 


es the distribution of the time duration of the bi 

Once the gamblers are committed to Playin g this ruin game, m me 
problem is ‘nvolved. But one can be introduced by the following Ks 
fication. As before, assume two gamblers, players 1 and 2, ente; a, 
game of survival with r and R — 17 dollars, respectively, but, instead r. 
chance device determining the payment at each trial, they play a six, 
zero-sum (in monetary units) game. If player 1 uses strategy a; (j=) 
2,- °° ,m) and 2 uses B; G = 1,2, -- - ,m) at any trial, then 1 rece 
a;; dollars from 2. The game is repeated until one of the players is ruin 
As an example (Hausner [1952a, b]) suppose: 

(a) Player 1 has dollars, where 7 = 1 or 2 or 3. 

(b) Player 2 has R — r dollars, where R = 4. 

(c) At each trial the players play the zero-sum game 


Bi Be 
Qa) 2 —1 
a@2|—2 ‘ies 


This ruin game is equivalent to the following three-component 
game: 


Feller also discuss 


recut 


Component Component Component 
Game [! Game I” Game I” 
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2 I e mort {| a,°|1 i" 
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The value transformation for 1 pli f 
yields 
Ch E 
WwW, = val | 
| 
a Sey 
A Ww} 
we’ = val | - 
“ ) 7 1 ' 
{ Of Wie del oe 1 
; a re 1 — wwe 
w3 = val = A hea 
Wy 1 ] 2? — 


It can be shown that this value transformation has only one fixed point 
(w*, w2*, w3*), where 


v3 v3 


w*=1- oe 0.293, wo = 0.5, w3* - 


Since w;* > 0, wo* > 0, w3* > 0, the general existence theory of recur- 
sive games implies that (w1*, w2*, w3*) is obtainable for player 2 and that 
a minimax strategy for player 2 is to play his minimax strategy in each of 
the component games I'*(w1*, wo*, w3*), k = 1, 2, 3. In this special 
game, the composite strategy in which player 1 uses his unique maximin 
strategies in the component games I*(w,*, wo*, w3*), k = 1, 2, 3, is 
maximin. 

In summary: if the two players have a total capital of four units and 
repeatedly play the zero-sum game 


By. Be 


QA) 2 —1 
(6m) —2 il 5 


then: 
Current Capital | Probability of 2’s Maximin Strategy | Minimax Strategy 
of Ruin When 1’s for 1 When 1’s for 2 When 1’s 
Player 1 Current Capital Isr Current Capital Is 7 | Current Capital Isr 
fal 0.293 (0.414a1, 0.586a2) | (0.41481, 0.58682) 
r=2 0.5 (0. 5a, 0.5a2) | (0.29381, 0.70782) 
r=3 0.707 (0.586a1, 0.414a@2) | (0.41461, 0.58682) 


Hausner’s treatment of this game, which differs from that given here, 
predates Everett’s work on recursive games and Shapley’s work on 
Stochastic games. 

As another example, also given by Hausner [19526], suppose the players 
each begin with one unit and they play a game of survival based on the 
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By “player 2’s survival game” let us refer to the game; 
ME In 


which 1 “wins” if and only if 2 is eventually ruined. There is no st 

7 Lf ° . . N) i] 

which makes 1 certain of winning in player 2’s survival game Tatty 
The induced recursive game for player 2’s survival game is 


Game T! 


Bi Be" 
ay} e. .1 
= On 


This one- ’ 

Fs Sree pene has already been studied (cf. p. 462) 

the value 1; and that . w,* = 1 is a critical value; that 2 can obtain 
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jii. A two-person zero-sum gam 
a;; if 1 uses a; and : uses B;, for i - 
The return a;; 15 an integer for all i, ; 
iv. This zero-sum game is played re 
ruined (i.e., his capital is reduced to zer 
v. Player 1 “wins” if and only if player 


NO 


“player 2’s survival game.”’) 

The above game induces a recursive 
games . k=1 
the payoff is: 


os 


eeetaceaR — 4. > If 1 uses a; and 2 uses 8 


(a) T&+ai;) if1 Sk+ ai; R-1 
(b) 0 ifk + a;; < 0, 
1 ifk+ ai; > R. 


From the general existence theory of recursive games we know: 1. 


This game has a unique critical vector—an (R — 1)-tuple (w, * 
Wr—1*)—which is the value of the game in this sense: If 1’s ini 


tial capital is 
rg units, then he can guarantee that 2 will be ruined with a pr 


obability that 
is arbitrarily close to W;,*, and 2 can guarantee that he will survive (not 
that 1 will be ruined !) w 


ith a probability that is at least equal to1 — w,,*. 
2. The critical vector is a fixed point of the value transformation, T; 
Which maps (w, we, * 5 Wr_-1) into (wy’, wo’, - + - , we_1’), where 


aS * 
»W2°, ’ 


wy’ = val I (wy, We, eae Wr—1), k= t 2, tetas fo ts 


Finding this fixed point of the value transformation entails solving R — 1 
€quations in R — 1 unknowns. 


3. From the interpretation of the problem, 

0 < wit < wo* € +++ < wei* <1; 
therefore, (wi*, wo*, - + + , we_1*) can be obtained by player 2, and his 
minimax strategy is to play a minimax strategy of I¥(w,*, wo*, - - - 
“r_1*) whenever the component game I™ occurs. 
4. Starting with the (R — 1)-tuple (0, 0, - - - , 0), successive iterations 
of the value mapping yield a componentwise monotonic non-decreasing 
Sequence of (R — 1)-tuples 


(1) (2) seen 
Be RO PY af, 6 el), 2 


(w{??, w8?), gages w°{?),), CF fe 


? 


> 
Which conver 


nua Y 
Slve game 


ge componentwise to the fixed point (w:*, wo*,-- + , 
The vector (w{?), w{?), - - - , w{?),) is the value of the recur- 
truncated at trial p by the vector (0, 0, - - : , 0) in the sense 


game with R — 1 component 
B;* in I*, then 


ti. 
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that, if 1’s initial capital IS 
| 
w\?? provide 

These resu 


theorem 


— 4 
results on univalent games which in turn 


results. 


rean Carn 
A Oh me 


units, his probab 
d that optimal strategies for the 3 
Its are similar to, but do not fc 

s about stochastic games; however, 


Of course, neither Shapley’s nor Eve 
Peisakoff. Peisakoff’s paper contains many 


yt rui s 
ining 2| 


NV tr) 


LCG Same are iT 
SEL 


mM, Shapley’, 
) follow from h; 


notivated by oy. 
esults were avai] 


- INENious trick, 


Jater, but arrived at independently, by Shapley and Everett. 
’ 


Milnor and Shapley [1955] have further g 


generalized the scope 


theory of games of survival by assuming that the payoffs a;; are not 
sarily integers. Since they need not be commensurate quantitig 
infinity of different distributions of capital can occur during a sip 


play of the game. 
Again, let the total capital of the players 


be R units, of which player 


has ro units. Neither R nor ro need be integers. At each stage they; 
the m by n zero-sum game (a;;], where the a;;’s are not necessarily integ 
It is assumed that a;; ~ Oforalliandj. If any row hasall positive entr 


its repeated use would automatically ruin 
column has all negative entries, player 2 
player 1. These cases are both trivial and 


player 2. Similarly, if 
would be certain of ruin 
special, so they are exci’ 


from the theory, i.e., we assume every row and every column has b 


positive and negative entries. The ultimate 


(i) 0 if player 1 is eventually ruined, 
im) 1 if player 2 is eventually ruined, 


payoff to player | is 


(iti) An amount P,, if the play is non-terminating. 


i hich 
‘ Milnor and Shapley show that such games have a value “a 
independent of the number P., and that both players have strat 


which are uniform] 


y optimal f : 
these conclusions is p eeall P. 


interested, fairly subtle, but we shall sketch it for thos 


> When 
the a;; : 
resou ij8 are in 


‘ ‘aad allocations is fini 

= rent of the value tr 

=, ‘hy ey than use the notation (w,*, - 

tl i) Present context, let us em 
€n the fixe 


g(r TF 431) 
¢(r) = val 


g(r na am 1) 


tegers and, therefore, when player dig 
te during any play, the value of a gam e 

ansformation. A related result holds fo" ' ould 
"= ,wr-1 )> 
: ploy the symbolism : 
functional equation d point of the value transforma 


ument les 
The arg - who? 


0 iW 
«ne gene 


” 


Ww iC W 2) 
1), 7, 
oes ” 


g(r + ain) f 


g(r + damn) 


A8.4] 
where 
yl = 4 
When r is restricted to integral values, the functional equation B, with 


1 y 
) DouNnaGAary 


conditions C, simplifies to R — 1 equations in R 
case, r is not restricted to integral values, but B and 

In a given play of the survival game, let 7; represent player 1’s capi 
of k trials. The sequence {ro, 13, 72, t 


the general 


i 
-} gives a trial-by-trial record of 1’s 


2 
financial holdings in the given play. If0 <r, < R for all k, then play does not 


terminate and 1’s payoff is P..; if0 <1, < Rfork =0,1,°°°,N—1andry < 
Qorry > R, then the play terminates at trial N, and we assume ry+p = rw for 
p=0,1,°--. If the players choose pure strategies for the survival game, 


then the sequence {7;} is uniquely determined; if they choose mixed strategies, 
then they jointly generate a probability measure over the set of {7,} sequences. 
In probabilists’ parlance, the set of sequences plus a probability measure over them 
is called a stochastic process, and a particular sequence is said to be a realization 
of this stochastic process. 

Suppose a given play results in a sequence {7,}. If player 1 is ultimately ruined 
(ry < 0) his payoff is zero regardless of the value ry, i.e., the payoff does not vary 
with the difference ry — 0. Similarly, if player 2 is ultimately ruined (rw 2 R), 
1’s payoff is 1 regardless of the value of ry, i.e., the payoff is independent of the 
difference ry — R. Neglecting these differences, which is conceptually trivial to 
do, leads to some mathematical complications. Let us see why. It is plausible 
that, for two different initial amounts of capital, player 1 can have the same 
expected payoffs. Mathematically, this means that the function g(r), which 
eventually will be identified as the value of the game to player 1 when his initial 
capital is r, will not be strictly increasing in the interval 0 €r< R. Without 
strict monotonicity, and therefore without a 1 to 1 relation between ¢(r) and r, we 
cannot make exact inferences about the sequence {r1,72, * * *} from an analysis of 
the mathematically more tractable sequence {g(r1), o(r2), °° ays 

We can eliminate the difficulty by modifying the payoffs of the survival game to 
take into account the excess by which the capital limits are exceeded, For exam- 
ple, the payoff given in A can be changed to: 


(i) ern, if ry < 0, 
Pd eG —R—M), . ifrn 22, (D) 
(iii) P,,, if0 < rz < R, for all k, 


where M = max |a;;| and ¢ is a small positive quantity. (Later, limits will be 
evaluated as Ecaraathes zero.) Similarly, the boundary condition C can be 
changed to 


(E) 


er, ifr < 
1 +ery —-R—™M); ifr 2 


Note that as¢ approaches zero, the payoffs in D approach those in A and the bound- 
ary conditions in E approach those in C. pay 
Now let us assume that we have a function ¢ which satisfies B and E and which is 
Strictly increasing inO <r < R. For the game with payoff D, let player 1 adopt 
the Strategy that, if he has a capital accumulation of rz,0 <7Tk < R, at trial k, then 


g(r) = | 


= 
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474 Sequen 
he plays 4 maximin strategy for the zero-sum game 
e 
[o(re <i ai;)|, t a i 5 Zs foe 5 It al , a 4 
; : umulation at trial & + 1 the ae | 
Player 1’s capital acc Bs, mixed. To $ Upon thang 


s fact ] 
either player’s strategy at t fact, We denote 


< ds clom 5 l 
pope ac tial +1 by fren whiere the “4 =. dj te 
eye — ajyj Ss U ICt O 
example, the probability that - is “ Be nes Q a ol the Probabi 
that 1 chooses @; and the eee ve wi "Ne value whi 
the function ¢ assumes at trial k + 1 depends, therefor pon chance, 
Suppose now we have a known past history (ro, 71, © °° » x) attrialk +1, 


as we assumed, 1 plays maximin in the game [y(re + a:;)] and 2 plays any fie 
strategy, then the random variable g(x41) must, by the meaning of maxini, 
have an expected value at least as large as val [g(x + ai;)]. But by the assump 
tion that ¢ satisfies eq. B, this must equal ¢(rz). Thus, if 1 follows this strategy a 
every k and 2 plays any fixed strategy, a measure is induced over the sequen 


{r,}, and therefore over {p(rx) }, with the property that 
Elo(iesi) | ore), «°°» Gro) 2 ore) F 


for each k and each partial sequence 70,71, °° * 5 7k. That is, for any past history 
ro 11) °° * 5 Tk [or equivalently, ¢(ro), o(71), °° * > (rx)] the random quanti) 
(#41) is well defined and its expected value, conditional upon the past, is at least 
g(rz). (To make this assertion mathematically precise, one must insert som 
“with probability 1” qualifiers.) A stochastic process which satisfies Fis said to be 
a semimartingale (Doob [1953]). Milnor and Shapley apply Doob’s existent 
theorem for semimartingales to show: 

1. The set of infinite sequences {y(rx)} for which the limit of (rx) ask" 
ng not exist has probability zero. Thus, intuitively, we can think of any play é 
the game as generating a sequence for which lim ¢(rx) exists. 

2. The limiting value must depend ae he sequence itsel! ¢ 
pende upon ch pend upon chance since the q aba 

P ance, so, for emphasis, we write lim ¢(*s)- The pt 


f dee 


distributi Pc... 6 
a e. of these limits is such that its expectation, conditional upon k 
0 18 at least y(ro), namely: 


now ing 


Bl lim elf) | e(ro)] 2 etr0)- 
From 

erie ete ne can conclude, as follows, that the probability of _ 
not converge is J ccording to 1 the probability that the sequence ss ipl” 
that the prokebilin and, since ¢ is assumed to be strictly monotonic, © es Bul 
Bate Mity that the sequence {r;,} does not converge sg also 2° yi 
Tk + ai; for some i and j and a;; ~ 0, rig 4TH +f | ' 
be the first trial whan, -.\”*!, implies the eventual ruin of one plays ise 
wally ruined oY One of the players is ruined. Now noting ote ly 
Rew SR ayy Mt PlN) < o(0) = 0 and if 2 is even™ 

and y(rvy) < (R + M) = 1, we get 
s 


£0~x Prob {1 is ruined | ro] + 1X Prob [23 


Prob) [2 is ruined | ro] 2 ¢(ro)- 


A8.4] 


Summary. If ¢ is a solutior 
[olr + aij)| whenever his cur 
play terminates with probabilit 
less than g(ro). Asimilar anal 
shows that, if player 2 plays mi 
y, then, regardless of 1’s stra th 
probability that 2 will be ruined i 
function y depends upon the value of th 5 PAL RE ASAE 
should write ¢.(ro) instead of merely (ro). The value to player 1 of the 
game is therefore 


and this is independent of Px. 

To show that there is a strictly monotonic solution to B and E, as we assumed 
earlier on p. 473, is a major feat in itself. As a step in this demonstration, Milnor 
and Shapley use the fact that player 1’s survival game (i.e., 1 loses only if 1 is 
ruined, i.e., Po = 1) has a value. This was demonstrated by Scarf and Shapley 


[1954] in a very abstract paper, which in turn depends upon work of Glicksberg 
[1950]. 


Milnor and Shapley [1955] go on to show that each of the players actually has an 
optimal strategy which forces an end to the play with probability 1, and that, 
therefore, such a strategy is uniformly optimal for all P.. c 


Milnor and Shapley’s results, although proved constructively, would be 
terribly difficult for a player to use, so a simple approximation to the solu- 
tion is desirable. They give one which is quite good provided that max 


\a;;| is small compared with the player’s initial fortunes, ro and R — 71. 
Let 


or — ‘ if A) € 0, 
e*(r) = Ke 1)/Xo pee 


if Ao = 0, 


where Xo is the unique?’ solution to 


Pat! = 1 ertin — 1 
ON mn 
val| | = 0. 
ehemi pas 4 i oe — 1 
Ke r 


Now, if at each trial of the given survival game players 1 and 2 use maxi- 
min and minimax strategies of the game [y*(a;;)], then the game termi- 
Nates with probability 1. If —m = min 4j; and M = max a;;, then 


a9 a7 
Player 2’s probability of being ruined is at least e*(ro)/e*(R + M) and at 


* The solution can be proved both to exist and to be unique. 
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+ m)/e*(R + m). Therefore, if tes the yal 
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. ve the bot 

Be anil game to player 1, we ha 
tne F 
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y*(ro) € o(r0) < ae 

o*(R + M) o*( 
increase in precision as m and nade smaller re), 


which, of course, 
tive to ro and R. 


First, g*(r) is clearly strictly monotonical, 
he proof follows. ) all 
pA sketch of t 


i i it satisfies eq. B since 
increasing. Second, it sa q 


portaii) — 1 
val [p*(r + aij)] = val es, 5 | 
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lat 
| 
SS 
Y 
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a ¢* (r), 


using the definition of g* and the fact that Ao is the solution to val [es i) ‘ 
=0. Third, the maximin and minimax strategies for the rah ly*tr ia 
are completely independent of r for, by what we have just seen, |¢ (r+ a i 
[¢*(a;;)] are strategically equivalent, i.e., differ only by a linear seat a 
the entries, and the latter does not depend upon. Now, if at each tn yi 
his maximin strategy for the game [y*(a;;)], then the sequence {o*(n 


ae sae : ; with probabil 
realization of a semimartingale stochastic process which converges with prob! 
one. But in that case, as we saw before, 


e*(ro) < E[lim g*(r * < o*(0) Prob /1 i ined | rol ‘ | 
k>« ) le*(ra)) < @*(0) Ren, Prob [2 is ruined | 
Since y*(0) = 0, we conclude 


* 
Prob. [2 is ruined | ro] > —? 
ruined | ro] 2 e*(R + M) 4 
The analysis for player 2 is similar. 


A8.5 
MULTICOMPONENT ATTRITION GAMES 
A whimsical : ich 
< oe Instance of q multicomponent attrition game, - tt? 
Class of gam ite Cats versus men and mice,” is due; 4 8 = eal! h 
€s, to m i) 
which initi : lackwell [1954 a]. In each component on woo! 


all Cc C 0 e 
i le Onsists of a) women and a§° cats, puts eith | gos 
j Ting without any knowledge of what team 3 


ee oO 


v 
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rule: a woman eliminates a mar 


mouse, Who in turn, eliminates a w 


Woman 


| T 
Team 1 i / 
Cat 1 loses Team 2 lose 


a Cat a mouse |. 


The overall game is one of attrition in the sense that the component games 
are repeated until one side is decimated, which it really is, of course, when- 
ever one of its two components is reduced to zero. Clearly, each team’s 
mixed strategy at each engagement should depend upon the current 
resources of both teams. 

Blackwell’s general class of games is a fairly straightforward generaliza- 
tion of the example. Player (or team) 1 has R different types of com- 
modities with an initial supply of a‘® units of type r, r = 1,2, ---, 2. 
Player (or team) 2 has S different types of commodities with an initial 
supply of 66” units of type s,s = 1,2, - - - ,S. It will be convenient to 
— the initial R-tuple (a6, af, - + +, a) by a, and, similarly, 
2’s initial S-tuple by b‘°’. We assume that the players have m and n pure 
strategies, respectively, and that the effect of each play of the game is to 
reduce their current supply of the commodities. This reduction is given 
by the attrition matrices [a,(i, j)), where the typical entry is the amount 
that player 1’s rth commodity is diminished when strategies ¢ and ) are 
used, and [8,(i, j)], where the typical entry is the amount that player 
2’s sth commodity is diminished when strategies ¢ andj are used. Thus, if 
the strategy pairs (i;, j1), (é2, j2), * * * » (tk) Je) are used during the first & 
trials, the remaining amounts of resources are: 


k 
> a@;(ig, jg), if this number is positive, 


q=1 


a®) = gi) — 


= 0, otherwise, 
and 


k 
B®) = pO — > B.(igs jq)s if this number is positive, 
q=1 
= 0, otherwise. 


It is assumed that each player tries to exhaust one—any one—of his 
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adversar 


vanish. 


i fe Can Sé te 
This is a recursive game, as WE Cé fs 


trial & to be either the game with the sig i cater the g 
trial k, provided neither player has a é his commy 

or 1 if any of player 2s stocks go to zero of 2s stony» 
zero and at least one of 1’s do. (Obsery nnvention i 

made that player 1 “‘wins” whenever both imultaneously exh 
acommodity.) The play terminates when eit! Oora 1 payoff co 
To guard against infinite play, Blackwell requires that at least one ¢ 


modity be diminished in each engagement and that no resupply ¢ 
occurs. Stated formally, for every (i, j) pair, 


R S 
¥ a-(2, J) + )) B.(%, j) > 0 
r=1 s=1 
and 
a,(i,j) 2 9, for every 7, 
and 


B.(i, 7) 2 0, for every S. 


In sum, a multicomponent attrition game is described by two comples 
of information: the initial resources (a‘”’, b), and the attrition mal 
[(a(z, Ds BC, |, which we shall abbreviate simply as (a, 6). Each - 
a the matrix (@, 6) is an (R + S)-tuple, the first R components at? 
@ j) entry [ox(i, j), - - - , ag(i, j)], being designated by w(i, i) a8 
last S components, [8;(i, j), - - - , Bs(é, j)], by Bi, j)- Since he P 
have been chosen to be 0 and 1, the value of the game 1s merely the p™ 
ability, which we denote by Pla, B; a, p}, that player 1 wins « 
cng “sg optimal strategies. Since multicomponent", 
tated pecia _Cases of recursive games, we know that, 0 

as a function of (a, b), with (a, 8) held constant 


Satisf P ; il 
Of : a basic functional equation of stochastic and recursive 

/ i i > ; us, ° 
Bl s©; €ven in special instances, that equation 1s monsttor” 
ackwell does not = ’ sig? 


ttempt to solve i Rather, he iV" 
the asymptotic behavi t as such. “ e incr 
indefinite] penavior of P as the resources (a ,b ) ate it 
For Guae to the condition that their relative oe if 

» One can loo 0) 4,(0)) pairs such ™ 
k for the set of (a‘, b‘"’) paits 


lim Pla, 8; ta, th] ” i. 


tow 
where, of yb 
» OF Course, ta — (4q(0) - ilatly 1 
= (0 0 yarn sho 
For the women ie (ta{, ta, - - - , ta) and sims sp? 


ats versus men and mice example, 
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that 


’ (0, 0; 1, 0) G20!:0°0 
lim P (0, 1;0,0) (0, 0; 0, 1 


to 


provided that 


Note, for example, that (0, 0; 1, 0), which is the (1, 1 
(a, 8), has the interpretation: team 1 loses zero won 
team 2 loses one man and zero mice. 


1en and zero cats and 


A8.6 APPROACHABILITY-EXCLUDABILITY THEORY AND 
COMPOUND DECISION PROBLEMS 


The asymptotic theory of multicomponent attrition games is based on 
Blackwell’s [1956 a] analogue of the minimax theorem for games with 


Fic. 2 


vector payoffs. In such games, the players have m and n pure strategies as 
usual, but the payoff corresponding to the (i, /) strategy pair is a Q-tuple 
(or vector in Q-space) of the form c(7,7) = [e1(?, J), cat, Js °° “5 cQ(t,7))- 
The multicomponent attrition games are of this form with Q=R+ Sand 
c(i, j) equal to the attrition payoff, but, as we shall see below, quite differ- 
ent interpretations of vector games also exist. 

Let us denote by C the convex hull of the set of points (in Q-space) 
c(i, }), where i and j vary over their domains. For example, if Q = 2, 
m= 2, and n = 3, then a typical region C is shown in Fig. 2. Blackwell 
Taises this question: If such a game is repeated in time, can player 1 force the 
average payoff to approach a preassigned closed subset T of C? Equally well, when 
‘an player 2 exclude the average payoff from T? 

The following notation will be useful. Let x = (x1, x2, °°." » Xm) be 
ne of player 1’s mixed strategies on a component game, then if player 2 


ee 


ntial Compounding of I 


4g0 Seque 
the expected Pa) 
ye ure strategy = rk 
uses P s 
ex. 7 — 


Thus, his expected payoff when he u 
points ¢(x; J = 


containing the ” ag 
s:*)- Exactly parallel notation Ly; ¢ 
player 2. Finally, the average P2)* ff 


® = CGer + C\ia, )2 = 


where (ia; Ja) denotes the strategy pair 

We observe that a sufficient conaiton fo! 
is the existence of astrategy y 9 such that * Hf 
y°” is used at each trial the average payoff will approach C(-, y 
not T. Blackwell shows, in essence, that this is 


7 


To be more precise: any convex set 1 is etter a, 
and the latter is equivalent to the existence or a y 
disjoint. Furthermore, he displays a strategy for | r 1 whi 
the average payoff to approach 7 whenever such a strategy exists 

The idea issimple. If at trial 4, the average payoff ¢™ ss alrext 
select any x on trialk + 1. If, however, ¢ E) and T are disjoint, chow 
so that C(x, -) and T lie on the same side of the supporting hyper? 
which both passes through the point ec’ of 7 that is 


is perpendicular to the line joining these two points. (See 


pons ‘ see PIs J 
sa x can be shown to exist if and only if the convex set Tis notes J 
(Roughly the idea is this: Suppose 1 tries to get an expected 
w : - 
ich lies as far below the separating hyperplane as poss! 
ha } = ‘ ey ; —_— 
a guarantee that 1 will not get a point on or below this NP° 
since . : + , 
mini ‘ y) intersects T for all y. Now we invoke the us 
below Saleen to conclude that 1 can therefore guarantee" 
e Vv = : a atl } 
will sok yperplane.) Since the expected payoff * & ° 
» Of Course, be in C(x, -), 1 = a 
argument by : > *), let us, for heuristic reas os 
; Y supposing that th . oe. heer 
c* in C( 5 e actual payoff on triai 
=(k) x, *); then the aver d 
c and ¢*. 


ght for we have 


ODS, 


he [ILS 


. the arg 
As yet, however, tHe “s 


$ dealing with expected values at > 
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Two points about approacha! 
tion: why is it related to the stud 
in what sense is it an analogue oi t 
be a problem since we know 
games, whereas the present theory 
Blackwell confined himself to qué 
initial resources are held in fixed proportion and 1 SE SRG Bet" 8 
It is thus plausible that each player’s ability to control the Hmiting behav- 
ior of the time average of the attrition payoffs will govern the outcome, 
and in fact it does. 

Next, let us turn to the sense in which the theory generalizes the mini- 
max theorem. Suppose that the payoffs c(i, j) are actually real numbers, 


Hyperplane through ec’ perpendicular 
to line from ¢ to e’ 
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ie, Q=1, and that they are interpreted as 1’s payoffs. If we let a 
denote the minimum and } the maximum of these mn numbers, the set C 
is simply the interval of the real line from a to 6 inclusive. If v denotes 
— of the game, player 1 can approach the interval [v, 6] and player 
"ti pen the interval [a, v]. Or in more familiar words, using the 
Lie ni numbers, the expected value v of a two-person zero-sum game 
given a frequency interpretation as the limiting value of a temporal 

average, 
ee promised a second and important interpretation of the ap- 
atin ey theory, and it is now time to fulfill it. Let us 
solely a ee aa game is to be repeated and that player 1 is 
.., eel in his long-term average payoff. He can certainly 
tert game ie average at least equal to the maximum value of the com- 
y playing maximin at each stage. But, as we have pointed 
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it has long been | 


f the following cases 


ecognized t} 
out previously; 
realistic in any 0 


rhe layer 2 is n 
Ase rame when pie 

= zero-sum &¢ 

ts In aZ 


4, In a non-zero-sulm game, 
‘i When player 2 is “nature” In 
My : . . . . te Se by] Fie 
satistical inference problem 
‘nty—the statistical 11 
uncertainty 


the ust 


Robbins [1951] has emphasized that when a ical) decision 
Jem is repeated in time, e.g., when a stream of individuals must be cag 
by their individual test responses, the statistician can often do ay, 


asymptotically with no prior information as when he knows the 
limiting proportion of times player 2 uses cach strategy. To be y 
specific, suppose 1’s payoffs are a;; and that a pr tort he knows that they 
portion of the time player 2 will use strategy ), ) ie 
He can, therefore, achieve the limiting average return 


6" 
ply *) = max () Qi 5); ') 
i 
j 


by playing that strategy 7 which maximizes the right-hand expression 

each trial. Hannan [1957] shows that asymptotically player 1 can doi 
well as p(y*) without knowing y * beforehand provided that he bases his choi 
at each trial on his knowledge of 2’s previous choices and on chat 

(Actually, he need only consider 2’s empirical mixed strategy over | 

preceding moves.) 


Wey? 
yd} 


: Blackwell [1956 b] shows that this can be concluded from approach 
ity-excludability theory. He chooses Q=n-+1 and defines 


Seon0, > , 0, 1,0, - - « , 0, ai) 
where the 1 appears in the 
8iven game to player 1. 

when one observes that 
empirical mixed 
1’s avera 
(n + 1) 


Call it 


jth position and a;; is the (i, j) payoll a 
This definition may seem strange, but" oe 1s 
the first n components of @“ equal ub i 
strategy over the first & trials and the last compo” 
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approaches a limit ask — ©. When the limit does not exist, the re 
interpreted roughly as meaning that the average payoff for lar 

close to p(y”). 
The approachability of T follows from the observation that, for each y, 
the set C(-, y) just touches T. This we can see as follows: If y = (y1, 72s 
- , yn), then C(-, y) is the set of (n + 1)-tuples (v1, ye 
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The choice of a strategy which leads the average payoff to approach T 
‘s far more subtle than it may seem. For example, player 1’s “obvious” 
strategy of playing optimal on trial & + 1 against 2’s empirical mixed 
strategy calculated over the first é trials need not force the average payoff 
to converge to T. Remember that player 2 may not employ the limiting 
mixed strategy y* at every (or indeed, any) of the trials. 

Besides this asymptotic result, Hannan [1957] also has a great deal to 
say about the rates of convergence for certain reasonable classes of player 
1’s strategies. Other papers which extend the pioneering work of 
Robbins [1951] on compound statistical decision problems are Hannan 
and Robbins [1955], Laderman [1955], and Johns [1956]. 
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A8.7 DIVIDEND POLICY AND ECONOMIC RUIN GAMES 


Most of the games we have encountered in this appendix meet the 
following very general description: a known stochastic process is under 
way, but at periodic intervals two players, perhaps opposing, can exert 
some influence on the process. Shubik [1957] has pointed out that 
corporate dividend policy can be looked upon in this way, and he has 
begun to examine games suggested by this interpretation. 

The simplest case is the degenerate single corporation game in which its 
assets fluctuate from period to period according to a simple chance 
mechanism. For example, if the capital accumulation is Z units (units 
in terms of thousands or tens of thousands of dollars) in one period, we 
might assume that in the next period it becomes Z +1 with probability 
pf, or Z — 1 with probability g = 1 — P- The corporation is ruined if at 
any period its capital drops below zero. Clearly, its chance of being 
ruined within a specified time period is less the greater the capital at the 
beginning of that period, but, on the other hand, money in the corporate 
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As stated, the problem is naively simple, but we can complicate it; 
variety of ways. First, it can be made a one-person game against na 
by supposing that p is unknown. Of course, as the random walk unfd 
data will be accumulated and inferences about / can be made. Butwi 
about dividend policy in the early stages? 

Second, another corporation may be assumed, also with pres! 
initial assets, and the two play a competing survival game. Thatis! 
any trial corporation 1 plays 7 and corporation 2 plays j, then their as 
are changed by a;; and b;; units, respectively. This survival game, 4! 
likely with a non-zero-sum component game, proceeds as usual, but! 
the simpler case both corporations must worry about dividend polit 
The problem is not completely formulated until it is known what happt" 
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sion possibilities are enormous. 
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